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HUMAN TRANSCRIPTIONAL REGULATOR PROTEINS 



TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of human transcriptional 
5 regulator proteins and to the use of these sequences in the diagnosis, treatment, and prevention of cell 
proliferative, autoimmune/inflammatory, and developmental disorders. 

BACKGROUND OF THE INVENTION 

Multicellular organisms are comprised of diverse cell types that differ dramatically both in 
10 structure and function. The identity of a cell is determined by its characteristic pattern of gene 
expression, and different cell types express overlapping but distinctive sets of genes throughout 
development. Spatial and temporal regulation of gene expression is critical for the control of cell 
proliferation, cell differentiation, apoptosis, and other processes that contribute to organismal 
development. Furthermore, gene expression is regulated in response to extracellular signals that 
15 mediate cell-cell communication and coordinate the activities of different cell types. Appropriate 
gene regulation also ensures that cells function efficiently by expressing only those genes whose 
functions are required at a given time. 
Transcription Factors 

Transcriptional regulatory proteins are essential for the control of gene expression. Some of 
20 these proteins function as transcription factors that initiate, activate, repress, or terminate gene 
transcription. Transcription factors generally bind to the promoter, enhancer, and upstream 
regulatory regions of a gene in a sequence-specific manner, although some factors bind regulatory 
elements within or downstream of a gene's coding region. Transcription factors may bind to a 
specific region of DNA singly or as a complex with other accessory factors. (Reviewed in Lewin, B. 
25 (1990) Genes IV . Oxford University Press, New York, NY, and Cell Press, Cambridge, MA, pp. 554- 
570.) 

The double helix structure and repeated sequences of DNA create topological and chemical 
features which can be recognized by transcription factors. These features are hydrogen bond donor 
and acceptor groups, hydrophobic patches, major and minor grooves, and regular, repeated stretches 
30 of sequence which induce distinct bends in the helix. Typically, transcription factors recognize 
specific DNA sequence motifs of about 20 nucleotides in length. Multiple, adjacent transcription 
factor-binding motifs may be required for gene regulation. 

Many transcription factors incorporate DNA-binding structural motifs which comprise either 
a helices or B sheets that bind to the major groove of DNA. Four well-characterized structural motifs 
35 are the helix -turn-helix, zinc finger, leucine zipper, and helix-loop-helix. Proteins containing these 
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motifs may act alone as monomers, or they may form homo- or heterodimers that interact with DNA. 

The helix-turn-helix motif consists of two a helices connected at a Fixed angle by a short 
chain of amino acids. One of the helices binds to the major groove. Helix-turn-helix motifs are 
exemplified by the homeobox motif which is present in homeodomain proteins. These proteins are 
5 critical for specifying the anterior-posterior body axis during development and are conserved 
throughout the animal kingdom. The Antennapedia and Ultrabithorax proteins of Drosophila 
melanogaster are prototypical homeodomain proteins (Pabo, CO. and R.T. Sauer (1992) Ann. Rev. 
Biochem. 61:1053-1095). 

The zinc finger motif, which binds zinc ions, generally contains tandem repeats of about 30 

10 amino acids consisting of periodically spaced cysteine and histidine residues. Examples of this 

sequence pattern include the C2H2-type, C4-type, and C3HC4-type ("RING" finger) zinc fingers, and 
the PHD domain (Lewin, supra ; Aasland, R. et al. (1995) Trends Biochem. Sci 20:56 - 59). Zinc 
finger proteins each contain an a helix and an antiparallel B sheet whose proximity and conformation 
are maintained by the zinc ion. Contact with DNA is made by the arginine preceding the a helix and 

15 by the second, third, and sixth residues of the a helix. The zinc finger motif may be repeated in a 
tandem array within a protein, such that the a helix of each zinc finger in the protein makes contact 
with the major groove of the DNA double helix. This repeated contact between the protein and the 
DNA produces a strong and specific DNA-protein interaction. The strength and specificity of the 
interaction can be regulated by the number of zinc finger motifs within the protein. 

20 The leucine zipper motif comprises a stretch of amino acids rich in leucine which can form 

an arnphipathic a helix. This structure provides the basis for dimerization of two leucine zipper 
proteins. The region adjacent to the leucine zipper is usually basic, and upon protein dimerization, is 
optimally positioned for binding to the major groove. Proteins containing such motifs are generally 
referred to as bZEP transcription factors. The leucine zipper motif is found in the proto-oncogenes 

25 Fos and Jun, which comprise the heterodimeric transcription factor API, involved in cell growth and 
the determination of cell lineage (Papavassiliou, A. G. (1995) N. Engl. J. Med. 332:45-47). 

The helix-loop-helix motif (HLH) consists of a short a helix connected by a loop to a longer 
a helix. The loop is flexible and allows the two helices to fold back against each other and to bind to 
DNA. The oncogene Myc, a transcription factor that activates genes required for cellular 

30 proliferation, contains a prototypical HLH motif. 

Most transcription factors contain characteristic DNA binding motifs, and variations on the 
above motifs and new motifs have been and are currently being characterized (Faisst, S. and S. Meyer 
(1992) Nucl. Acids Res. 20:3-26). These include the forkhead motif, found in transcription factors 
involved in development and oncogenesis (Hacker, U. et al. (1995) EMBO J. 14:5306-5317). 

35 Chromatin Associated Proteins 
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In the nucleus, DNA is packaged into chromatin, the compact organization of which limits 
the accessibility of DNA to transcription factors and plays a key role in gene regulation (Lewin, 
supra , pp. 409-410). The compact structure of chromatin is determined and influenced by chromatin- 
associated proteins such as the histones, the high mobility group (HMG) proteins, helicases, and the 

5 chromodomain proteins. There are five classes of histones, HI, H2A, H2B, H3, and H4, all of which 
are highly basic, low molecular weight proteins. The fundamental unit of chromatin, the nucleosome, 
consists of 200 base pairs of DNA associated with two copies each of H2A, H2B, H3, and H4. HI 
links adjacent nucleosomes. HMG proteins are low molecular weight, non-histone proteins that may 
play a role in unwinding DNA and stabilizing single-stranded DNA. Helicases, which are DNA- 

10 dependent ATPases, unwind DNA, allowing access for transcription factors. Chromodomain proteins 
play a key role in the formation of highly compacted heterochromatin, which is transcriptionally 
silent. 

Diseases and disorders related to gene reflation 

Many neoplastic disorders in humans can be attributed to inappropriate gene expression. 

15 Malignant cell growth may result from either excessive expression of tumor promoting genes or 
insufficient expression of tumor suppressor genes (Cleary, M.L. (1992) Cancer Surv. 15:89-104). 
The zinc finger-type transcriptional regulator WT1 is a tumor-suppressor protein that is inactivated in 
children with Wilm's tumor. The oncogene bcl-6, which plays an important role in large-cell 
lymphoma, is also a zinc-finger protein (Papavassiliou, supra) . Chromosomal translocations may 

20 also produce chimeric loci which fuse the coding sequence of a transcriptional regulator with the 

regulatory regions of a second unrelated gene. In Burkitt's lymphoma, for example, the transcription 
factor Myc is translocated to the immunoglobulin heavy chain locus, greatly enhancing Myc 
expression and resulting in rapid cell growth leading to leukemia (Latchman, D. S. (1996) N. Engl. J. 
Med. 334:28-33). 

25 In addition, the immune system responds to infection or trauma by activating a cascade of 

events that coordinate the progressive selection, amplification, and mobilization of cellular defense 
mechanisms. A complex and balanced program of gene activation and repression is involved in this 
process. However, hyperactivity of the immune system as a result of improper or insufficient 
regulation of gene expression may result in considerable tissue or organ damage. This damage is 

30 well documented in immunological responses associated with arthritis, allergens, heart attack, stroke, 
and infections (Isselbacher et al. Harrison's Principles of Internal Medicine , 13/e, McGraw Hill, Inc. 
and Teton Data Systems Software, 1996). The causative gene for autoimmune polyendocrinopathy- 
candidiasis-ectodermal dystrophy (APECED) was recently isolated and found to encode a protein 
with two PHD-type zinc finger motifs (Bjorses, P. et al. (1998) Hum. Mol. Genet. 7: 1547-1553). 

35 Furthermore, the generation of multicellular organisms is based upon the induction and 
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coordination of cell differentiation at the appropriate stages of development. Central to this process 
is differential gene expression, which confers the distinct identities of cells and tissues throughout the 
body. Failure to regulate gene expression during development can result in developmental disorders. 
Human developmental disorders caused by mutations in zinc finger-type transcriptional regulators 
5 include: urogenital developmental abnormalities associated with WT1; Greig 

cephalopolysyndactyly, Pallister-Hall syndrome, and postaxial Polydactyly type A (GLI3), and 
Townes-Brocks syndrome, characterized by anal, renal, limb, and ear abnormalities (SALL1) 
(Engelkamp, D. and V. van Heyningen (1996) Curr. Opin. Genet. Dev. 6:334-342; Kohlhase, J. et al. 
(1999) Am. J. Hum. Genet. 64:435-445). 
10 The discovery of new human transcriptional regulator proteins and the polynucleotides 

encoding them satisfies a need in the art by providing new compositions which are useful in the 
diagnosis, prevention, and treatment of cell proliferative, autoimmune/inflammatory, and 
developmental disorders. 

15 SUMMARY OF THE INVENTION 

The invention features purified polypeptides, human transcriptional regulator proteins, 
referred to collectively as "TXREG" and individually as "TXREG-1," "TXREG-2," "TXREG-3," 
"TXREG-4," "TXREG-5," "TXREG-6," "TXREG-7," "TXREG-8," "TXREG-9," "TXREG-10," 
"TXREG-1 1," "TXREG-12," "TXREG-13," "TXREG-14," "TXREG-15," "TXREG-16," "TXREG- 

20 17," "TXREG-18," "TXREG-19," "TXREG-20," "TXREG-21," "TXREG-22," "TXREG-23," 

"TXREG-24," "TXREG-25," "TXREG-26," "TXREG-27," "TXREG-28," "TXREG-29," "TXREG- 
30," "TXREG-31," and "TXREG-32." In one aspect, the invention provides an isolated polypeptide 
comprising an amino acid sequence selected from the group consisting of a) an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32, b) a naturally occurring amino acid sequence 

25 having at least 90% sequence identity to an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-32, c) a biologically active fragment of an amino acid sequence selected from the 
group consisting of SEQ ED NO: 1-32, and d) an immunogenic fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32. In one alternative, the invention provides an 
isolated polypeptide comprising the amino acid sequence of SEQ ID NO: 1-32. 

30 The invention further provides an isolated polynucleotide encoding a polypeptide comprising 

an amino acid sequence selected from the group consisting of a) an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32, b) a naturally occurring amino acid sequence having 
at least 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-32, c) a biologically active fragment of an amino acid sequence selected from the group 

35 consisting of SEQ ID NO: 1-32, and d) an immunogenic fragment of an amino acid sequence selected 
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from the group consisting of SEQ ID NO: 1-32. In one alternative, the polynucleotide encodes a 
polypeptide selected from the group consisting of SEQ ID NO: 1-32. In another alternative, the 
polynucleotide is selected from the group consisting of SEQ ED NO:33-64. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
5 sequence operably linked to a polynucleotide encoding a polypeptide comprising an amino acid 
sequence selected from the group consisting of a) an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32, b) a naturally occurring amino acid sequence having at least 90% 
sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, 
c) a biologically active fragment of an amino acid sequence selected from the group consisting of 
10 SEQ ID NO: 1-32, and d) an immunogenic fragment of an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-32. In one alternative, the invention provides a cell transformed 
with the recombinant polynucleotide. In another alternative, the invention provides a transgenic 
organism comprising the recombinant polynucleotide. 

The invention also provides a method for producing a polypeptide comprising an amino acid 
15 sequence selected from the group consisting of a) an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32, b) a naturally occurring amino acid sequence having at least 90% 
sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, 
c) a biologically active fragment of an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-32, and d) an immunogenic fragment of an amino acid sequence selected from the 
20 group consisting of SEQ ID NO: 1-32. The method comprises a) culturing a cell under conditions 
suitable for expression of the polypeptide, wherein said cell is transformed with a recombinant 
polynucleotide comprising a promoter sequence operably linked to a polynucleotide encoding the 
polypeptide, and b) recovering the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
25 polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-32, b) a naturally occurring amino 
acid sequence having at least 90% sequence identity to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-32, c) a biologically active fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic fragment of an amino 
30 acid sequence selected from the group consisting of SEQ ID NO: 1-32. 

The invention further provides an isolated polynucleotide comprising a polynucleotide 
sequence selected from the group consisting of a) a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:33-64, b) a naturally occurring polynucleotide sequence having at least 
70% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID 
35 NO:33-64, c) a polynucleotide sequence complementary to a), d) a polynucleotide sequence 

5 
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complementary to b), and e) an RNA equivalent of a)-d). In one alternative, the polynucleotide 
comprises at least 60 contiguous nucleotides. 

Additionally, the invention provides a method for detecting a target polynucleotide in a 
sample, said target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide 
5 sequence selected from the group consisting of a) a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:33-64, b) a naturally occurring polynucleotide sequence having at least 
70% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:33-64, c) a polynucleotide sequence complementary to a), d) a polynucleotide sequence 
complementary to b), and e) an RNA equivalent of a)-d). The method comprises a) hybridizing the 

10 sample with a probe comprising at least 20 contiguous nucleotides comprising a sequence 

complementary to said target polynucleotide in the sample, and which probe specifically hybridizes to 
said target polynucleotide, under conditions whereby a hybridization complex is formed between said 
probe and said target polynucleotide or fragments thereof, and b) detecting the presence or absence of 
said hybridization complex, and optionally, if present, the amount thereof. In one alternative, the 

15 probe comprises at least 60 contiguous nucleotides. 

The invention further provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide 
sequence selected from the group consisting of a) a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:33-64, b) a naturally occurring polynucleotide sequence having at least 

20 70% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:33-64, c) a polynucleotide sequence complementary to a), d) a polynucleotide sequence 
complementary to b), and e) an RNA equivalent of a)-d). The method comprises a) amplifying said 
target polynucleotide or fragment thereof using polymerase chain reaction amplification, and b) 
detecting the presence or absence of said amplified target polynucleotide or fragment thereof, and, 

25 optionally, if present, the amount thereof. 

The invention further provides a pharmaceutical composition comprising an effective amount 
of a polypeptide comprising an amino acid sequence selected from the group consisting of a) an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, b) a naturally occurring 
amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from 

30 the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-32, and a pharmaceutically 
acceptable excipient. In one embodiment, the pharmaceutical composition comprises an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-32. The invention additionally 

35 provides a method of treating a disease or condition associated with decreased expression of 
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functional TXREG, comprising administering to a patient in need of such treatment the 
pharmaceutical composition. 

The invention also provides a method for screening a compound for effectiveness as an 
agonist of a polypeptide comprising an amino acid sequence selected from the group consisting of a) 

5 an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32. The 

10 method comprises a) exposing a sample comprising the polypeptide to a compound, and b) detecting 
agonist activity in the sample. In one alternative, the invention provides a pharmaceutical 
composition comprising an agonist compound identified by the method and a pharmaceutically 
acceptable excipient. In another alternative, the invention provides a method of treating a disease or 
condition associated with decreased expression of functional TXREG, comprising administering to a 

15 patient in need of such treatment the pharmaceutical composition. 

Additionally, the invention provides a method for screening a compound for effectiveness as 
an antagonist of a polypeptide comprising an amino acid sequence selected from the group consisting 
of a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 

20 selected from the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32. The 
method comprises a) exposing a sample comprising the polypeptide to a compound, and b) detecting 
antagonist activity in the sample. In one alternative, the invention provides a pharmaceutical 

25 composition comprising an antagonist compound identified by the method and a pharmaceutically 
acceptable excipient. In another alternative, the invention provides a method of treating a disease or 
condition associated with overexpression of functional TXREG, comprising administering to a 
patient in need of such treatment the pharmaceutical composition. 

The invention further provides a method of screening for a compound that specifically binds 

30 to a polypeptide comprising an amino acid sequence selected from the group consisting of a) an 

amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, b) a naturally occurring 
amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from 
the group consisting of SEQ ED NO: 1-32, c) a biologically active fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic fragment of an amino 

35 acid sequence selected from the group consisting of SEQ ID NO: 1-32. The method comprises a) 
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combining the polypeptide with at least one test compound under suitable conditions, and b) 
detecting binding of the polypeptide to the test compound, thereby identifying a compound that 
specifically binds to the polypeptide. 

The invention further provides a method of screening for a compound that modulates the 
5 activity of a polypeptide comprising an amino acid sequence selected from the group consisting of a) 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, b) a naturally 
occurring amino acid sequence having at least 90% sequence identify to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic 

10 fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32. The 
method comprises a) combining the polypeptide with at least one test compound under conditions 
permissive for the activity of the polypeptide, b) assessing the activity of the polypeptide in the 
presence of the test compound, and c) comparing the activity of the polypeptide in the presence of the 
test compound with the activity of the polypeptide in the absence of the test compound, wherein a 

15 change in the activity of the polypeptide in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide. 

The invention further provides a method for screening a compound for effectiveness in 
altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NO:33-64, the method comprising a) 

20 exposing a sample comprising the target polynucleotide to a compound, and b) detecting altered 
expression of the target polynucleotide. 



BRIEF DESCRIPTION OF THE TABLES 

Table 1 shows polypeptide and nucleotide sequence identification numbers (SEQ ID NOs), 
25 clone identification numbers (clone IDs), cDNA libraries, and cDNA fragments used to assemble full- 
length sequences encoding TXREG. 

Table 2 shows features of each polypeptide sequence, including potential motifs, homologous 
sequences, and methods, algorithms, and searchable databases used for analysis of TXREG. 
Table 3 shows selected fragments of each nucleic acid sequence; the tissue-specific 
30 expression patterns of each nucleic acid sequence as determined by northern analysis; diseases, 
disorders, or conditions associated with these tissues; and the vector into which each cDNA was 
cloned. 

Table 4 describes the tissues used to construct the cDNA libraries from which cDNA clones 
encoding TXREG were isolated. 
35 Table 5 shows the tools, programs, and algorithms used to analyze the polynucleotides and 
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polypeptides of the invention, along with applicable descriptions, references, and threshold 
parameters. 

DESCRIPTION OF THE INVENTION 

5 Before the present proteins, nucleotide sequences, and methods are described, it is understood 

that this invention is not limited to the particular machines, materials and methods described, as these 
may vary. It is also to be understood that the terminology used herein is for the purpose of describing 
particular embodiments only, and is not intended to limit the scope of the present invention which 
will be limited only by the appended claims. 

10 It must be noted that as used herein and in the appended claims, the singular forms "a," "an," 

and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
reference to "a host cell" includes a plurality of such host cells, and a reference to *'an antibody" is a 
reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so 
forth. 

15 Unless defined otherwise, all technical and scientific terms used herein have the same 

meanings as commonly understood by one of ordinary skill in the art to which this invention belongs. 
Although any machines, materials, and methods similar or equivalent to those described herein can be 
used to practice or test the present invention, the preferred machines, materials and methods are now 
described. All publications mentioned herein are cited for the purpose of describing and disclosing 

20 the cell lines, protocols, reagents and vectors which are reported in the publications and which might 
be used in connection with the invention. Nothing herein is to be construed as an admission that the 
invention is not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

"TXREG" refers to the amino acid sequences of substantially purified TXREG obtained from 
25 any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
TXREG. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of TXREG either by directly interacting with 
30 TXREG or by acting on components of the biological pathway in which TXREG participates. 

An "allelic variant" is an alternative form of the gene encoding TXREG. Allelic variants may 
result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
35 allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 

9 
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Each of these types of changes may occur alone, or in combination with the others, one or more times 
in a given sequence. 

"Altered" nucleic acid sequences encoding TXREG include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as TXREG or 
5 a polypeptide with at least one functional characteristic of TXREG. Included within this definition 
are polymorphisms which may or may not be readily detectable using a particular oligonucleotide 
probe of the polynucleotide encoding TXREG, and improper or unexpected hybridization to allelic 
variants, with a locus other than the normal chromosomal locus for the polynucleotide sequence 
encoding TXREG. The encoded protein may also be "altered," and may contain deletions, insertions, 

10 or substitutions of amino acid residues which produce a silent change and result in a functionally 
equivalent TXREG. Deliberate amino acid substitutions may be made on the basis of similarity in 
polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the 
residues, as long as the biological or immunological activity of TXREG is retained. For example, 
negatively charged amino acids may include aspartic acid and glutamic acid, and positively charged 

15 amino acids may include lysine and arginine. Amino acids with uncharged polar side chains having 
similar hydrophilicity values may include: asparagine and glutamine; and serine and threonine. 
Amino acids with uncharged side chains having similar hydrophilicity values may include: leucine, 
isoleucine, and valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 

20 polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 
molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally occurring 
protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid 
sequence to the complete native amino acid sequence associated with the recited protein molecule. 
"Amplification" relates to the production of additional copies of a nucleic acid sequence. 

25 Amplification is generally carried out using polymerase chain reaction (PCR) technologies well 
known in the art. 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity 
of TXREG. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of TXREG either by 
30 directly interacting with TXREG or by acting on components of the biological pathway in which 
TXREG participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments 
thereof, such as Fab, F(ab') 2 > and Fv fragments, which are capable of binding an epitopic determinant. 
Antibodies that bind TXREG polypeptides can be prepared using intact polypeptides or using 
35 fragments containing small peptides of interest as the immunizing antigen. The polypeptide or 
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oligopeptide used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the 
translation of RNA, or synthesized chemically, and can be conjugated to a carrier protein if desired. 
Commonly used carriers that are chemically coupled to peptides include bovine serum albumin, 
thyroglobulin, and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize 
5 the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (particular regions or three-dimensional structures 

10 on the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen 
used to elicit the immune response) for binding to an antibody. 

The term "antisense" refers to any composition capable of base-pairing with the "sense" 
(coding) strand of a specific nucleic acid sequence. Antisense compositions may include DNA; 
RNA; peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 

15 phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified 
sugar groups such as 2'-methoxyethyl sugars or 2-methoxyethoxy sugars; or oligonucleotides having 
modified bases such as 5-methyl cytosine, 2'-deoxyuracil, or 7-deaza-2'-deoxyguanosine. Antisense 
molecules may be produced by any method including chemical synthesis or transcription. Once 
introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring 

20 nucleic acid sequence produced by the cell to form duplexes which block either transcription or 
translation. The designation "negative" or "minus" can refer to the antisense strand, and the 
designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturally occurring molecule. Likewise, "immunologically active" or "immunogenic" 

25 refers to the capability of the natural, recombinant, or synthetic TXREG, or of any oligopeptide 

thereof, to induce a specific immune response in appropriate animals or cells and to bind with specific 
antibodies. 

"Complementary" describes the relationship between two single-stranded nucleic acid 
sequences that anneal by base-pairing. For example, 5'-AGT-3' pairs with its complement, 
30 3'-TCA-5\ 

A "composition comprising a given polynucleotide sequence" and a "composition comprising 
a given amino acid sequence" refer broadly to any composition containing the given polynucleotide 
or amino acid sequence. The composition may comprise a dry formulation or an aqueous solution. 
Compositions comprising polynucleotide sequences encoding TXREG or fragments of TXREG may 
35 be employed as hybridization probes. The probes may be stored in freeze-dried form and may be 
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associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 
deployed in an aqueous solution containing salts (e.g., NaCI), detergents (e.g., sodium dodecyl 
sulfate; SDS), and other components (e.g., Denhardt's solution, dry milk, salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to repeated 
5 DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (PE Biosystems, 
Foster City CA) in the 5' and/or the 3' direction, and resequenced, or which has been assembled from 
one or more overlapping cDNA, EST, or genomic DNA fragments using a computer program for 
fragment assembly, such as the GEL VIEW fragment assembly system (GCG, Madison WI) or Phrap 
(University of Washington, Seattle WA). Some sequences have been both extended and assembled to 
10 produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 
interfere with the properties of the original protein, i.e., the structure and especially the function of 
the protein is conserved and not significantly changed by such substitutions. The table below shows 
amino acids which may be substituted for an original amino acid in a protein and which are regarded 



15 


as conservative amino acid substitutions. 

Original Residue 


Conservative Substitution 




Ala 


Gly, Ser 




Arg 


His, Lys 




Asn 


Asp, Gin, His 


20 


Asp 


Asn, Glu 




Cys 


Ala, Ser 




Gin 


Asn, Glu, His 




Glu 


Asp, Gin, His 




Gly 


Ala 


25 


His 


Asn, Arg, Gin, Glu 




He 


Leu, Val 




Leu 


He, Val 




Lys 


Arg, Gin, Glu 




Met 


Leu, He 


30 


Phe 


His, Met, Leu, Trp, Tyr 




Ser 


Cys, Thr 




Thr 


Ser, Val 




Trp 


Phe, Tyr 




Tyr 


His, Phe, Trp 


35 


Val 


He, Leu, Thr 



Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 
backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of 
40 the side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 
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The term "derivative" refers to a chemically modified polynucleotide or polypeptide. 
Chemical modifications of a polynucleotide sequence can include, for example, replacement of 
hydrogen by an alkyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a 
polypeptide which retains at least one biological or immunological function of the natural molecule. 
5 A derivative polypeptide is one modified by glycosylation, pegylation, or any similar process that 
retains at least one biological or immunological function of the polypeptide from which it was 
derived. 

A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

10 A "fragment" is a unique portion of TXREG or the polynucleotide encoding TXREG which is 

identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up 
to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, 
a fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment 
used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 

15 15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino acid 
residues in length. Fragments may be preferentially selected from certain regions of a molecule. For 
example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected 
from the first 250 or 500 amino acids (or first 25% or 50% of a polypeptide) as shown in a certain 
defined sequence. Clearly these lengths are exemplary, and any length that is supported by the 

20 specification, including the Sequence Listing, tables, and figures, may be encompassed by the present 
embodiments. 

A fragment of SEQ ID NO:33-64 comprises a region of unique polynucleotide sequence that 
specifically identifies SEQ ID NO:33-64, for example, as distinct from any other sequence in the 
genome from which the fragment was obtained. A fragment of SEQ ID NO:33-64 is useful, for 
25 example, in hybridization and amplification technologies and in analogous methods that distinguish 
SEQ ID NO:33-64 from related polynucleotide sequences. The precise length of a fragment of SEQ 
ID NO:33-64 and the region of SEQ ID NO:33-64 to which the fragment corresponds are routinely 
determinable by one of ordinary skill in the art based on the intended purpose for the fragment. 

A fragment of SEQ ID NO: 1-32 is encoded by a fragment of SEQ ID NO:33-64. A fragment 
30 of SEQ ID NO: 1-32 comprises a region of unique amino acid sequence that specifically identifies 

SEQ ID NO: 1-32. For example, a fragment of SEQ ID NO: 1-32 is useful as an immunogenic peptide 
for the development of antibodies that specifically recognize SEQ ID NO: 1-32. The precise length of 
a fragment of SEQ ID NO: 1-32 and the region of SEQ ID NO: 1-32 to which the fragment 
. corresponds are routinely determinable by one of ordinary skill in the art based on the intended 
35 purpose for the fragment. 
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A "full-length" polynucleotide sequence is one containing at least a translation initiation 
codon (e.g., methionine) followed by an open reading frame and a translation termination codon. A 
"full-length" polynucleotide sequence encodes a "full-length" polypeptide sequence. 

"Homology" refers to sequence similarity or, interchangeably, sequence identity, between 
5 two or more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and "% identity," as applied to polynucleotide sequences, refer 
to the percentage of residue matches between at least two polynucleotide sequences aligned using a 
standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps 
in the sequences being compared in order to optimize alignment between two sequences, and 

10 therefore achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e 
sequence alignment program. This program is part of the LASERGENE software package, a suite of 
molecular biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in 

15 Higgins, D.G. and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 
8:189-191. For pairwise alignments of polynucleotide sequences, the default parameters are set as 
follows: Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue 
weight table is selected as the default. Percent identity is reported by CLUSTAL V as the "percent 
similarity" between aligned polynucleotide sequences. 

20 Alternatively, a suite of commonly used and freely available sequence comparison algorithms 

is provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment 
Search Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which is available 
from several sources, including the NCBI, Bethesda, MD, and on the Internet at 
http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence 

25 analysis programs including "blastn," that is used to align a known polynucleotide sequence with 

other polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gorf/bl2.html. 
The "BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 

30 programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.12 (April-2 1-2000) set at default parameters. Such default parameters may be, for example: 
Matrix: BLOSUM62 
Reward for match: 1 

35 Penalty for mismatch: -2 
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Open Gap: 5 and Extension Gap: 2 penalties 
Gap x drop-off: 50 
Expect: 10 
Word Size: 11 
5 Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example, 
as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, 
over the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at 
least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous 

10 nucleotides. Such lengths are exemplary only, and it is understood that any fragment length 

supported by the sequences shown herein, in the tables, figures, or Sequence Listing, may be used to 
describe a length over which percentage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes 

15 in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid 
sequences that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some 

20 alignment methods take into account conservative amino acid substitutions. Such conservative 

substitutions, explained in more detail above, generally preserve the charge and hydrophobicity at the 
site of substitution, thus preserving the structure (and therefore function) of the polypeptide. 

Percent identity between polypeptide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEG ALIGN version 3.12e 

25 sequence alignment program (described and referenced above). For pairwise alignments of 

polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap 
penalty=3, window=5, and "diagonals saved"=5. The PAM250 matrix is selected as the default 
residue weight table. As with polynucleotide alignments, the percent identity is reported by 
CLUSTAL V as the "percent similarity" between aligned polypeptide sequence pairs. 

30 Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 

comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 
2.0.12 (Apr-2 1-2000) with blastp set at default parameters. Such default parameters may be, for 
example: 

Matrix: BLOSUM62 

35 Open Gap: 1 1 and Extension Gap: 1 penalties 
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Gap x drop-off: 50 
Expect: 10 
Word Size: 3 
Filter: on 

5 Percent identity may be measured over the length of an entire defined polypeptide sequence, 

for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for 
instance, a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 
150 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment 

10 length supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be 
used to describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements required for 
chromosome replication, segregation and maintenance. 

15 The term "humanized antibody" refers to an antibody molecule in which the amino acid 

sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 

20 hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 
Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after the "washing" step(s). The washing step(s) is particularly important in determining the 
stringency of the hybridization process, with more stringent conditions allowing less non-specific 
binding, i.e., binding between pairs of nucleic acid strands that are not perfectly matched. Permissive 

25 conditions for annealing of nucleic acid sequences are routinely determinable by one of ordinary skill 
in the art and may be consistent among hybridization experiments, whereas wash conditions may be 
varied among experiments to achieve the desired stringency, and therefore hybridization specificity. 
Permissive annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 
1% (w/v) SDS, and about 100 jig/ml sheared, denatured salmon sperm DNA. 

30 Generally, stringency of hybridization is expressed, in part, with reference to the temperature 

under which the wash step is carried out. Such wash temperatures are typically selected to be about 
5°C to 20°C lower than the thermal melting point (T m ) for the specific sequence at a defined ionic 
strength and pH. The T m is the temperature (under defined ionic strength and pH) at which 50% of 
the target sequence hybridizes to a perfectly matched probe. An equation for calculating T m and 

35 conditions for nucleic acid hybridization are well known and can be found in Sambrook, J. et al., 
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1989, Molecular Cloning: A Laboratory Manual s 2 nd ed., vol. 1-3, Cold Spring Harbor Press, 
Plainview NY; specifically see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present 
invention include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS, 

5 for 1 hour. Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC 
concentration may be varied from about 0.1 to 2 x SSC, with SDS being present at about 0.1%. 
Typically, blocking reagents are used to block non-specific hybridization. Such blocking reagents 
include, for instance, sheared and denatured salmon sperm DNA at about 100-200 Mg/ml. Organic 
solvent, such as formamide at a concentration of about 35-50% v/v, may also be used under particular 

10 circumstances, such as for RNA:DNA hybridizations. Useful variations on these wash conditions 
will be readily apparent to those of ordinary skill in the art. Hybridization, particularly under high 
stringency conditions, may be suggestive of evolutionary similarity between the nucleotides. Such 
similarity is strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 
The term "hybridization complex" refers to a complex formed between two nucleic acid 

15 sequences by virtue of the formation of hydrogen bonds between complementary bases. A 

hybridization complex may be formed in solution (e.g., C 0 t or R^t analysis) or formed between one 
nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a solid 
support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate 
to which cells or their nucleic acids have been fixed). 

20 The words "insertion" and "addition" refer to changes in an amino acid or nucleotide 

sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 
of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 

25 cellular and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of TXREG which is 
capable of eliciting an immune response when introduced into a living organism, for example, a 
mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment 
of TXREG which is useful in any of the antibody production methods disclosed herein or known in 

30 the art. 

The term "microarray" refers to an arrangement of a plurality of polynucleotides, 
polypeptides, or other chemical compounds on a substrate. 

The terms "element" and "array element" refer to a polynucleotide, polypeptide, or other 
chemical compound having a unique and defined position on a microarray. 
35 The term "modulate" refers to a change in the activity of TXREG. For example, modulation 
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may cause an increase or a decrease in protein activity, binding characteristics, or any other 
biological, functional, or immunological properties of TXREG. 

The phrases "nucleic acid" and "nucleic acid sequence'* refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
5 synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 

10 sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. 

15 PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript 
elongation, and may be pegylated to extend their lifespan in the cell. 

"Post-translational modification" of an TXREG may involve lipidation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in 
the art. These processes may occur synthetically or biochemically. Biochemical modifications will 

20 vary by cell type depending on the enzymatic milieu of TXREG. 

"Probe" refers to nucleic acid sequences encoding TXREG, their complements, or fragments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. 
Typical labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. 

25 "Primers" are short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target 
polynucleotide by complementary base-pairing. The primer may then be extended along the target 
DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification (and 
identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 

30 nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers 
may be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, may be used. 

35 Methods for preparing and using probes and primers are described in the references, for 
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example Sambrook, J. et aL, 1989, Molecular Cloning: A Laboratory Manual , 2 nd ed., vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel, F.M. et al.,1987, Current Protocols in Molecular 
Biology , Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis, M. et aL, 1990, PCR 
Protocols, A Guide to Methods and Applications , Academic Press, San Diego CA. PCR primer pairs 
5 can be derived from a known sequence, for example, by using computer programs intended for that 
purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge 
MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 

10 100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 
5,000 nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer 
selection programs have incorporated additional features for expanded capabilities. For example, the 
PrimOU primer selection program (available to the public from the Genome Center at University of 
Texas South West Medical Center, Dallas TX) is capable of choosing specific primers from 

15 megabase sequences and is thus useful for designing primers on a genome-wide scope. The Primer3 
primer selection program (available to the public from the Whitehead Institute/MIT Center for 
Genome Research, Cambridge MA) allows the user to input a "mispriming library," in which 
sequences to avoid as primer binding sites are user-specified. Primer3 is useful, in particular, for the 
selection of oligonucleotides for microarrays. (The source code for the latter two primer selection 

20 programs may also be obtained from their respective sources and modified to meet the user's specific 
needs.) The PrimeGen program (available to the public from the UK Human Genome Mapping 
Project Resource Centre, Cambridge UK) designs primers based on multiple sequence alignments, 
thereby allowing selection of primers that hybridize to either the most conserved or least conserved 
regions of aligned nucleic acid sequences. Hence, this program is useful for identification of both 

25 unique and conserved oligonucleotides and polynucleotide fragments. The oligonucleotides and 

polynucleotide fragments identified by any of the above selection methods are useful in hybridization 
technologies, for example, as PCR or sequencing primers, microarray elements, or specific probes to 
identify fully or partially complementary polynucleotides in a sample of nucleic acids. Methods of 
oligonucleotide selection are not limited to those described above. 

30 A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 

that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sambrook, supra . The term recombinant includes nucleic acids that have 
35 been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a 
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recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter 
sequence. Such a recombinant nucleic acid may be part of a vector that is used, for example, to 
transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
5 vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
expressed, inducing a protective immunological response in the mammal. 

A "regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
regions of a gene and includes enhancers, promoters, introns, and 5' and 3' untranslated regions 
(UTRs). Regulatory elements interact with host or viral proteins which control transcription, 
10 translation, or RNA stability. 

"Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 
amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
chemiluminescent, or chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and 
other moieties known in the art. 
15 An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 

sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing nucleic 
20 acids encoding TXREG, or fragments thereof, or TXREG itself, may comprise a bodily fluid; an 

extract from a cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, 
RNA, or cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 
protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
25 synthetic binding composition. The interaction is dependent upon the presence of a particular 
structure of the protein, e.g., the antigenic determinant or epitope, recognized by the binding 
molecule. For example, if an antibody is specific for epitope "A," the presence of a polypeptide 
comprising the epitope A, or the presence of free unlabeled A, in a reaction containing free labeled A 
and the antibody will reduce the amount of labeled A that binds to the antibody. 
30 The term "substantially purified" refers to nucleic acid or amino acid sequences that are 

removed from their natural environment and are isolated or separated, and are at least 60% free, 
preferably at least 75% free, and most preferably at least 90% free from other components with which 
they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acid residues or nucleotides 
35 by different amino acid residues or nucleotides, respectively. 
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"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 
5 A "transcript image" refers to the collective pattern of gene expression by a particular cell 

type or tissue under given conditions at a given time. 

"Transformation" describes a process by which exogenous DNA is introduced into a recipient 
cell. Transformation may occur under natural or artificial conditions according to various methods 
well known in the art, and may rely on any known method for the insertion of foreign nucleic acid 
10 sequences into a prokaryotic or eukaryotic host cell. The method for transformation is selected based 
on the type of host cell being transformed and may include, but is not limited to, bacteriophage or 
viral infection, electroporation, heat shock, lipofection, and particle bombardment. The term 
"transformed" cells includes stably transformed cells in which the inserted DNA is capable of 
replication either as an autonomously replicating plasmid or as part of the host chromosome, as well 
15 as transiently transformed cells which express the inserted DNA or RNA for limited periods of time. 
A "transgenic organism," as used herein, is any organism, including but not limited to 
animals and plants, in which one or more of the cells of the organism contains heterologous nucleic 
acid introduced by way of human intervention, such as by transgenic techniques well known in the 
art. The nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor 
20 of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with 
a recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in 
vitro fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The 
transgenic organisms contemplated in accordance with the present invention include bacteria, 
cyanobacteria, fungi, plants, and animals. The isolated DNA of the present invention can be 
25 introduced into the host by methods known in the art, for example infection, transfection, 

transformation or transconjugation. Techniques for transferring the DNA of the present invention 
into such organisms are widely known and provided in references such as Sambrook et al. (1989), 
supra . 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having 
30 at least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of 
the nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95% or at least 98% or 
greater sequence identity over a certain defined length. A variant may be described as, for example, 
35 an "allelic" (as defined above), "splice," "species," or "polymorphic" variant. A splice variant may 
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have significant identity to a reference molecule, but will generally have a greater or lesser number of 
polynucleotides due to alternative splicing of exons during mRNA processing. The corresponding 
polypeptide may possess additional functional domains or lack domains that are present in the 
reference molecule. Species variants are polynucleotide sequences that vary from one species to 
5 another. The resulting polypeptides generally will have significant amino acid identity relative to 
each other. A polymorphic variant is a variation in the polynucleotide sequence of a particular gene 
between individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
polymorphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide base. The 
presence of SNPs may be indicative of, for example, a certain population, a disease state, or a 

10 propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 
at least 40% sequence identity to the particular polypeptide sequence over a certain length of one of 
the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at 

15 least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% or greater sequence 
identity over a certain defined length of one of the polypeptides. 
THE INVENTION 

The invention is based on the discovery of new human transcriptional regulator proteins 
(TXREG), the polynucleotides encoding TXREG, and the use of these compositions for the 

20 diagnosis, treatment, or prevention of cell proliferative, autoimmune/inflammatory, and 
developmental disorders. 

Table 1 lists the Incyte clones used to assemble full length nucleotide sequences encoding 
TXREG. Columns 1 and 2 show the sequence identification numbers (SEQ ID NOs) of the 
polypeptide and nucleotide sequences, respectively. Column 3 shows the clone IDs of the Incyte 

25 clones in which nucleic acids encoding each TXREG were identified, and column 4 shows the cDNA 
libraries from which these clones were isolated. Column 5 shows Incyte clones and their 
corresponding cDNA libraries. Clones for which cDNA libraries are not indicated were derived from 
pooled cDNA libraries. The Incyte clones in column 5 were used to assemble the consensus 
nucleotide sequence of each TXREG and are useful as fragments in hybridization technologies. 

30 The columns of Table 2 show various properties of each of the polypeptides of the invention: 

column 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 
polypeptide; column 3 shows potential phosphorylation sites; column 4 shows potential glycosylation 
sites; column 5 shows the amino acid residues comprising signature sequences and motifs; column 6 
shows homologous sequences as identified by BLAST analysis; and column 7 shows analytical 

35 methods and in some cases, searchable databases to which the analytical methods were applied. The 
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methods of column 7 were used to characterize each polypeptide through sequence homology and 
protein motifs. 

The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions 
associated with nucleotide sequences encoding TXREG. The first column of Table 3 lists the 

5 nucleotide SEQ ED NOs. Column 2 lists fragments of the nucleotide sequences of column 1. These 
fragments are useful, for example, in hybridization or amplification technologies to identify SEQ ID 
NO:33-64 and to distinguish between SEQ ED NO:33-64 and related polynucleotide sequences. The 
polypeptides encoded by these fragments are useful, for example, as immunogenic peptides. Column 
3 lists tissue categories which express TXREG as a fraction of total tissues expressing TXREG. 

10 Column 4 lists diseases, disorders, or conditions associated with those tissues expressing TXREG as a 
fraction of total tissues expressing TXREG. Column 5 lists the vectors used to subclone each cDNA 
library. 

The columns of Table 4 show descriptions of the tissues used to construct the cDNA libraries 
from which cDNA clones encoding TXREG were isolated. Column 1 references the nucleotide SEQ 

15 ED NOs, column 2 shows the cDNA libraries from which these clones were isolated, and column 3 
shows the tissue origins and other descriptive information relevant to the cDNA libraries in column 2. 

SEQ ID NO:33 maps to chromosome 1 within the interval from 199.2 to 203.0 centiMorgans, 
to chromosome 6 within the interval from 59.6 to 73.9 centiMorgans, and to chromosome 13 within 
the interval from 1 12.8 to 1 17.5 centiMorgans. The interval on chromosome 6 from 59.6 to 73.9 

20 centiMorgans also contains genes associated with methylmalonic CoA mutase deficiency and retinal 
degeneration. The interval on chromosome 13 from 1 12.8 to 1 17.5 centiMorgans also contains genes 
associated with Oguchi disease (night blindness) and Factor X deficiency. SEQ ID NO:34 maps to 
chromosome 13 within the interval from 1 12.8 to 1 17.5 centiMorgans. This interval also contains 
genes associated with Oguchi disease (night blindness) and Factor X deficiency. SEQ ID NO:35 

25 maps to chromosome 12 within the interval from 1 13.3 to 126.1 centiMorgans. This interval also 
contains genes associated with spinocerebellar ataxia, mevalonate kinase deficiency, alcohol 
intolerance, and myocardial hypertrophy. SEQ ID NO:36 maps to chromosome 1 within the interval 
from 155.2 to 157.4 centiMorgans, and to chromosome 16 within the interval from 83.7 to 86.6 
centiMorgans. The interval on chromosome 1 from 155.2 to 157.4 centiMorgans also contains genes 

30 associated with leukemia and adrenal hyperplasia. The interval on chromosome 16 from 83.7 to 86.6 
centiMorgans also contains a gene associated with Cortisol 1 1-beta-keto reductase deficiency. SEQ 
ID NO:38 maps to chromosome 9 within the interval from 59.9 to 64.5 centiMorgans. SEQ ID 
NO:40 maps to chromosome 18 within the interval from 61 .2 to 63.2 centiMorgans. SEQ ID NO:44 
maps to chromosome 2 within the interval from 180.6 to 188.2 centiMorgans. This interval also 

35 contains a gene associated with glutamate decarboxylase deficiency. SEQ ID NO:45 maps to 
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chromosome 13 within the interval from 1 12.8 to 1 17.5 centiMorgans. This interval also contains 
genes associated with Oguchi disease (night blindness) and Factor X deficiency. SEQ ID NO:47 
maps to chromosome 8 within the interval from 75.0 to 90.2 centiMorgans. This interval also 
contains genes associated with branchiootorenal dysplasia and Zellweger syndrome. SEQ ID NO:61 
5 maps to chromosome 5 within the interval from 63.9 to 69.6 centiMorgans. SEQ ID NO:62 maps to 
chromosome 7 within the interval from 120.7 to 123.9 centiMorgans. This interval also contains 
genes associated with lipoamide dehydrogenase deficiency, neonatal cutis laxa, and tumor 
suppression. SEQ ID NO:64 maps to chromosome 1 within the interval from 157.4 to 186.4 
centiMorgans, to chromosome 5 within the interval from 175.3 to 182.4 centiMorgans, and to 
10 chromosome 14 within the interval from 7.5 to 21 .9 centiMorgans. The interval on chromosome 1 
from 157.4 to 186.4 centiMorgans also contains genes associated with autoimmune diseases, 
leukemia, and Gaucher disease. The interval on chromosome 14 from 7.5 to 21.9 centiMorgans also 
contains genes associated with apoptosis, hypertrophic cardiomopathy, and oculopharyngeal 
muscular dystrophy. 

15 The invention also encompasses TXREG variants. A preferred TXREG variant is one which 

has at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid 
sequence identity to the TXREG amino acid sequence, and which contains at least one functional or 
structural characteristic of TXREG. 

The invention also encompasses polynucleotides which encode TXREG. In a particular 

20 embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
from the group consisting of SEQ ID NO:33-64, which encodes TXREG. The polynucleotide 
sequences of SEQ ID NO:33-64, as presented in the Sequence Listing, embrace the equivalent RNA 
sequences, wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the 
sugar backbone is composed of ribose instead of deoxyribose. 

25 The invention also encompasses a variant of a polynucleotide sequence encoding TXREG. In 

particular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at 
least about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide 
sequence encoding TXREG. A particular aspect of the invention encompasses a variant of a 
polynucleotide sequence comprising a sequence selected from the group consisting of SEQ ID 

30 NO:33-64 which has at least about 70%, or alternatively at least about 85%, or even at least about 
95% polynucleotide sequence identity to a nucleic acid sequence selected from the group consisting 
of SEQ ID NO:33-64. Any one of the polynucleotide variants described above can encode an amino 
acid sequence which contains at least one functional or structural characteristic of TXREG. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the 

35 genetic code, a multitude of polynucleotide sequences encoding TXREG, some bearing minimal 
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similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
sequence that could be made by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as applied to the 

5 polynucleotide sequence of naturally occurring TXREG, and all such variations are to be considered 
as being specifically disclosed. 

Although nucleotide sequences which encode TXREG and its variants are generally capable 
of hybridizing to the nucleotide sequence of the naturally occurring TXREG under appropriately 
selected conditions of stringency, it may be advantageous to produce nucleotide sequences encoding 

10 TXREG or its derivatives possessing a substantially different codon usage, e.g., inclusion of non- 
naturally occurring codons. Codons may be selected to increase the rate at which expression of the 
peptide occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with 
which particular codons are utilized by the host. Other reasons for substantially altering the 
nucleotide sequence encoding TXREG and its derivatives without altering the encoded amino acid 

15 sequences include the production of RNA transcripts having more desirable properties, such as a 
greater half-life, than transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode TXREG and 
TXREG derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic sequence may be inserted into any of the many available expression vectors and cell 

20 systems using reagents well known in the art. Moreover, synthetic chemistry may be used to 
introduce mutations into a sequence encoding TXREG or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO:33-64 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and 

25 S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymoi. 

152:507-51 1.) Hybridization conditions, including annealing and wash conditions, are described in 
"Definitions." 

Methods for DNA sequencing are well known in the art and may be used to practice any of 
the embodiments of the invention. The methods may employ such enzymes as the Klenow fragment 

30 of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (PE 
Biosystems, Foster City CA), thermostable T7 polymerase (Amersham Pharmacia Biotech, 
Piscataway NJ), or combinations of polymerases and proofreading exonucleases such as those found 
in the ELONGASE amplification system (Life Technologies, Gaithersburg MD). Preferably, 
sequence preparation is automated with machines such as the MICROLAB 2200 liquid transfer 

35 system (Hamilton, Reno NV), PTC200 thermal cycler (MJ Research, Watertown MA) and ABI 
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CATALYST 800 thermal cycler (PE Biosystems). Sequencing is then carried out using either the 
ABI 373 or 377 DNA sequencing system (PE Biosystems), the MEGABACE 1000 DNA sequencing 
system (Molecular Dynamics, Sunnyvale CA), or other systems known in the art. The resulting 
sequences are analyzed using a variety of algorithms which are well known in the art. (See, e.g., 
5 Ausubel, F.M. (1997) Short Protocols in Molecular Biology , John Wiley & Sons, New York NTY, unit 
7.7; Meyers, R.A. (1995) Molecular Bioloev and Biotechnology , Wiley VCH, New York NY, pp. 
856-853.) % 

The nucleic acid sequences encoding TXREG may be extended utilizing a partial nucleotide 
sequence and employing various PCR-based methods known in the art to detect upstream sequences, 

10 such as promoters and regulatory elements. For example, one method which may be employed, 

restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 2:31 8-322.) 
Another method, inverse PCR, uses primers that extend in divergent directions to amplify unknown 
sequence from a circularized template. The template is derived from restriction fragments comprising 

15 a known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids 
Res. 16:8186.) A third method, capture PCR, involves PCR amplification of DNA fragments 
adjacent to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, 
M. et al. (1991) PCR Methods Applic. 1:111-1 19.) In this method, multiple restriction enzyme 
digestions and ligations may be used to insert an engineered double-stranded sequence into a region 

20 of unknown sequence before performing PCR. Other methods which may be used to retrieve 

unknown sequences are known in the art. (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 
19:3055-3060). Additionally, one may use PCR, nested primers, and PROMOTERFINDER libraries 
(Clontech, Palo Alto CA) to walk genomic DNA. This procedure avoids the need to screen libraries 
and is useful in finding intron/exon junctions. For all PCR-based methods, primers may be designed 

25 using commercially available software, such as OLIGO 4.06 Primer Analysis software (National 
Biosciences, Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in 
length, to have a GC content of about 50% or more, and to anneal to the template at temperatures of 
about 68°C to 72°C. 

When screening for full-length cDNAs, it is preferable to use libraries that have been 
30 size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) 
library does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence 
into 5' non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze 
35 the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
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sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, PE Biosystems), and the entire 
5 process from loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 
which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof 
which encode TXREG may be cloned in recombinant DNA molecules that direct expression of 
10 TXREG, or fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent 
degeneracy of the genetic code, other DNA sequences which encode substantially the same or a 
functionally equivalent amino acid sequence may be produced and used to express TXREG. 

The nucleotide sequences of the present invention can be engineered using methods generally 
known in the art in order to alter TXREG-encoding sequences for a variety of purposes including, but 
15 not limited to, modification of the cloning, processing, and/or expression of the gene product. DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 
oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 
mediated site-directed mutagenesis may be used to introduce mutations that create new restriction 
sites, alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 
20 The nucleotides of the present invention may be subjected to DNA shuffling techniques such 

as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 
5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat. 
Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or 
improve the biological properties of TXREG, such as its biological or enzymatic activity or its ability 
25 to bind to other molecules or compounds. DNA shuffling is a process by which a library of gene 
variants is produced using PCR-mediated recombination of gene fragments. The library is then 
subjected to selection or screening procedures that identify those gene variants with the desired 
properties. These preferred variants may then be pooled and further subjected to recursive rounds of 
DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 
30 breeding and rapid molecular evolution. For example, fragments of a single gene containing random 
point mutations may be recombined, screened, and then reshuffled until the desired properties are 
optimized." Alternatively, fragments of a given gene may be recombined with fragments of 
homologous genes in the same gene family, either from the same or different species, thereby 
maximizing the genetic diversity of multiple naturally occurring genes in a directed and controllable 
35 manner. 
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In another embodiment, sequences encoding TXREG may be synthesized, in whole or in part, 
using chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. (1980) Nucleic Acids 
Symp. Ser. 7:215-223; and Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) 
Alternatively, TXREG itself or a fragment thereof may be synthesized using chemical methods. For 
5 example, peptide synthesis can be performed using various solution-phase or solid-phase techniques. 
(See, e.g., Creighton, T. (1984) Proteins. Structures and Molecular Properties . WH Freeman, New 
York NY, pp. 55-60; and Roberge, J.Y. et at. (1995) Science 269:202-204.) Automated synthesis 
may be achieved using the ABI 431 A peptide synthesizer (PE Biosystems). Additionally, the amino 
acid sequence of TXREG, or any part thereof, may be altered during direct synthesis and/or combined 

10 with sequences from other proteins, or any part thereof, to produce a variant polypeptide or a 
polypeptide having a sequence of a naturally occurring polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See, e.g., Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421.) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by 

15 sequencing. (See, e.g., Creighton, supra , pp. 28-53.) 

In order to express a biologically active TXREG, the nucleotide sequences encoding TXREG 
or derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which 
contains the necessary elements for transcriptional and translational control of the inserted coding 
sequence in a suitable host. These elements include regulatory sequences, such as enhancers, 

20 constitutive and inducible promoters, and 5' and 3' untranslated regions in the vector and in 
polynucleotide sequences encoding TXREG. Such elements may vary in their strength and 
specificity. Specific initiation signals may also be used to achieve more efficient translation of 
sequences encoding TXREG. Such signals include the ATG initiation codon and adjacent sequences, 
e.g. the Kozak sequence. In cases where sequences encoding TXREG and its initiation codon and 

25 upstream regulatory sequences are inserted into the appropriate expression vector, no additional 

transcriptional or translational control signals may be needed. However, in cases where only coding 
sequence, or a fragment thereof, is inserted, exogenous translational control signals including an in- 
frame ATG initiation codon should be provided by the vector. Exogenous translational elements and 
initiation codons may be of various origins, both natural and synthetic. The efficiency of expression 

30 may be enhanced by the inclusion of enhancers appropriate for the particular host cell system used. 
(See, e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct expression 
vectors containing sequences encoding TXREG and appropriate transcriptional and translational 
control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, 

35 and in vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molecular Cloning. A 
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Laboratory Manual , Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, F.M. et 
aL (1995) Current Protocols in Molecular Biology . John Wiley & Sons, New York NY, ch. 9, 13, and 
16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 

5 encoding TXREG. These include, but are not limited to, microorganisms such as bacteria 

transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast 
transformed with yeast expression vectors; insect cell systems infected with viral expression vectors 
(e.g., baculovirus); plant cell systems transformed with viral expression vectors (e.g., cauliflower 
mosaic virus, CaMV, or tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or 

10 pBR322 plasmids); or animal cell systems. (See, e.g., Sambrook, supra ; Ausubel, supra ; Van Heeke, 
G. and S.M. Schuster (1989) J. Biol. Chem. 264:5503-5509; Bitter, G.A. et al. (1987) Methods 
Enzymol. 153:516-544; Scorer, C.A. et al. (1994) Bio/Technology 12:181-184; Engelhard, E.K. et al. 
(1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937- 
1945; Takamatsu, N. (1987) EMBO J. 6:307-311; Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; 

15 Broglie, R. et al. (1984) Science 224:838-843; Winter, J. et al. (1991) Results Probl. Cell Differ. 

17:85-105; The McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York 
NY, pp. 191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81 .3655-3659; and 
Harrington, J.J. et al. (1997) Nat. Genet. 15:345-355.) Expression vectors derived from retroviruses, 
adenoviruses, or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for 

20 delivery of nucleotide sequences to the targeted organ, tissue, or cell population. (See, e.g., Di 

Nicola, M. et al. (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M. et al., (1993) Proc. Natl. Acad. Sci. 
USA 90(1 3):6340-6344; Buller, R.M. et al. (1985) Nature 317(6040):813-815; McGregor, D.P. et al. 
(1994) Mol. Immunol. 3 1(3):2 19-226; and Verma, LM. and N. Somia (1997) Nature 389:239-242.) 
The invention is not limited by the host cell employed. 

25 In bacterial systems, a number of cloning and expression vectors may be selected depending 

upon the use intended for polynucleotide sequences encoding TXREG. For example, routine cloning, 
subcloning, and propagation of polynucleotide sequences encoding TXREG can be achieved using a 
multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 
plasmid (Life Technologies). Ligation of sequences encoding TXREG into the vector's multiple 

30 cloning site disrupts the lacL gene, allowing a colorimetric screening procedure for identification of 
transformed bacteria containing recombinant molecules. In addition, these vectors may be useful for 
in vitro transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of 
nested deletions in the cloned sequence. (See, e.g., Van Heeke, G. and S.M. Schuster (1989) J. Biol. 
Chem. 264:5503-5509.) When large quantities of TXREG are needed, e.g. for the production of 

35 antibodies, vectors which direct high level expression of TXREG may be used. For example, vectors 
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containing the strong, inducible T5 or T7 bacteriophage promoter may be used. 

Yeast expression systems may be used for production of TXREG. A number of vectors 
containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomvces cerevisiae or Pichia pastoris . In addition, such 
5 vectors direct either the secretion or intracellular retention of expressed proteins and enable 

integration of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 
1 1995, supra ; Bitter, supra ; and Scorer, supra .) 

Plant systems may also be used for expression of TXREG. Transcription of sequences 
encoding TXREG may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV used 
10 alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 
6:307-31 1). Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock 
promoters may be used. (See, e.g., Coruzzi, supra ; Broglie, supra ; and Winter, su pra .) These 
constructs can be introduced into plant cells by direct DNA transformation or pathogen-mediated 
transfection. (See, e.g., The McGraw Hill Yearbook of Science and Technology (1992) McGraw 
15 Hill, New York NY, pp. 191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, sequences encoding TXREG may be ligated into 
an adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 
20 infective virus which expresses TXREG in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV- 
based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 
25 DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino 
polymers, or vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al. (1997) Nat. Genet. 
15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable expression 
30 of TXREG in cell lines is preferred. For example, sequences encoding TXREG can be transformed 
into cell lines using expression vectors which may contain viral origins of replication and/or 
endogenous expression elements and a selectable marker gene on the same or on a separate vector. 
Following the introduction of the vector, cells may be allowed to grow for about 1 to 2 days in 
enriched media before being switched to selective media. The purpose of the selectable marker is to 
35 confer resistance to a selective agent, and its presence allows growth and recovery of cells which 
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successfully express the introduced sequences. Resistant clones of stably transformed cells may be 
propagated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These 
include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 

5 phosphoribosyltransferase genes, for use in tk and apr cells, respectively. (See, e.g., Wigler, M. et 
al. (1977) Cell 1 1:223-232; Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, 
or herbicide resistance can be used as the basis for selection. For example, dhfr confers resistance to 
methotrexate; neo confers resistance to the aminoglycosides neomycin and G-418; and als and pat 
confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., 

10 Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) 
J. Mol. Biol. 150: 1-14.) Additional selectable genes have been described, e.g., trpB and hisD, which 
alter cellular requirements for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. 
Natl. Acad. Sci. USA 85:8047-8051 .) Visible markers, e.g., anthocyanins, green fluorescent proteins 
(GFP; Clontech), fi glucuronidase and its substrate 6-glucuronide, or luciferase and its substrate 

15 luciferin may be used. These markers can be used not only to identify transformants, but also to 

quantify the amount of transient or stable protein expression attributable to a specific vector system. 
(See, e.g., Rhodes, C,A. (1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of interest is 
also present, the presence and expression of the gene may need to be confirmed. For example, if the 

20 sequence encoding TXREG is inserted within a marker gene sequence, transformed cells containing 
sequences encoding TXREG can be identified by the absence of marker gene function. Alternatively, 
a marker gene can be placed in tandem with a sequence encoding TXREG under the control of a 
single promoter. Expression of the marker gene in response to induction or selection usually 
indicates expression of the tandem gene as well. 

25 In general, host cells that contain the nucleic acid sequence encoding TXREG and that 

express TXREG may be identified by a variety of procedures known to those of skill in the art. These 
procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 
amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or 
chip based technologies for the detection and/or quantification of nucleic acid or protein sequences. 

30 Immunological methods for detecting and measuring the expression of TXREG using either 

specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques 
include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and 
fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering epitopes on TXREG is preferred, but a 

35 competitive binding assay may be employed. These and other assays are well known in the art. (See, 
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e.g., Hampton, R. et al. (1990) Serological Methods, a Laboratory Manual , APS Press, St. Paul MN, 
Sect. IV; Coiigan, J.E. et al. (1997) Current Protocols in Immunology , Greene Pub. Associates and 
Wiley-Interscience, New York NY; and Pound, J.D. (1998) Immunochemical Protocols , Humana 
Press, TotowaNJ.) 

5 A wide variety of labels and conjugation techniques are known by those skilled in the art and 

may be used in various nucleic acid and amino acid assays. Means for producing labeled 
hybridization or PCR probes for detecting sequences related to polynucleotides encoding TXREG 
include oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled nucleotide. 
Alternatively, the sequences encoding TXREG, or any fragments thereof, may be cloned into a vector 

10 for the production of an mRNA probe. Such vectors are known in the art, are commercially available, 
and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase 
such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety 
of commercially available kits, such as those provided by Amersham Pharmacia Biotech, Promega 
(Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be used for 

15 ease of detection include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic 
agents, as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding TXREG may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture. The protein 
produced by a transformed cell may be secreted or retained intracellular! y depending on the sequence 

20 and/or the vector used. As will be understood by those of skill in the art, expression vectors 

containing polynucleotides which encode TXREG may be designed to contain signal sequences 
which direct secretion of TXREG through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
inserted sequences or to process the expressed protein in the desired fashion. Such modifications of 

25 the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, 

phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a "prepro" or 
"pro" form of the protein may also be used to specify protein targeting, folding, and/or activity. 
Different host cells which have specific cellular machinery and characteristic mechanisms for 
post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the 

30 American Type Culture Collection (ATCC, Manassas VA) and may be chosen to ensure the correct 
modification and processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding TXREG may be ligated to a heterologous sequence resulting in translation of a 
fusion protein in any of the aforementioned host systems. For example, a chimeric TXREG protein 

35 containing a heterologous moiety that can be recognized by a commercially available antibody may 

32 



WO 00/78954 



PCT/US00/16766 



facilitate the screening of peptide libraries for inhibitors of TXREG activity. Heterologous protein 
and peptide moieties may also facilitate purification of fusion proteins using commercially available 
affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), 
maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, 

5 c-myc, and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their 
cognate fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and 
metal-chelate resins, respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity 
purification of fusion proteins using commercially available monoclonal and polyclonal antibodies 
that specifically recognize these epitope tags. A fusion protein may also be engineered to contain a 

10 proteolytic cleavage site located between the TXREG encoding sequence and the heterologous 
protein sequence, so that TXREG may be cleaved away from the heterologous moiety following 
purification. Methods for fusion protein expression and purification are discussed in Ausubel (1995, 
supra , ch. 10). A variety of commercially available kits may also be used to facilitate expression and 
purification of fusion proteins. 

15 In a further embodiment of the invention, synthesis of radiolabeled TXREG may be achieved 

in vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These 
systems couple transcription and translation of protein-coding sequences operably associated with the 
T7, T3, or SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid 
precursor, for example, 35 S-methionine. 

20 TXREG of the present invention or fragments thereof may be used to screen for compounds 

that specifically bind to TXREG. At least one and up to a plurality of test compounds may be 
screened for specific binding to TXREG. Examples of test compounds include antibodies, 
oligonucleotides, proteins (e.g., receptors), or small molecules. 

In one embodiment, the compound thus identified is closely related to the natural ligand of 

25 TXREG, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 
natural binding partner. (See, Coligan, J.E. et al. (1991) Current Protocols in Immunology 1(2): 
Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which TXREG 
binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, the 
compound can be rationally designed using known techniques. In one embodiment, screening for 

30 these compounds involves producing appropriate cells which express TXREG, either as a secreted 
protein or on the cell membrane. Preferred cells include cells from mammals, yeast, Drosophila , or 
E. coli . Cells expressing TXREG or cell membrane fractions which contain TXREG are then 
contacted with a test compound and binding, stimulation, or inhibition of activity of either TXREG or 
the compound is analyzed. 

35 An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
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detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For example, 
the assay may comprise the steps of combining at least one test compound with TXREG, either in 
solution or affixed to a solid support, and detecting the binding of TXREG to the compound. 
Alternatively, the assay may detect or measure binding of a test compound in the presence of a 
5 labeled competitor. Additionally, the assay may be carried out using cell-free preparations, chemical 
libraries, or natural product mixtures, and the test compound(s) may be free in solution or affixed to a 
solid support. v 

TXREG of the present invention or fragments thereof may be used to screen for compounds 
that modulate the activity of TXREG. Such compounds may include agonists, antagonists, or partial 

10 or inverse agonists. In one embodiment, an assay is performed under conditions permissive for 

TXREG activity, wherein TXREG is combined with at least one test compound, and the activity of 
TXREG in the presence of a test compound is compared with the activity of TXREG in the absence 
of the test compound. A change in the activity of TXREG in the presence of the test compound is 
indicative of a compound that modulates the activity of TXREG. Alternatively, a test compound is 

15 combined with an in vitro or cell-free system comprising TXREG under conditions suitable for 
TXREG activity, and the assay is performed. In either of these assays, a test compound which 
modulates the activity of TXREG may do so indirectly and need not come in direct contact with the 
test compound. At least one and up to a plurality of test compounds may be screened. 

In another embodiment, polynucleotides encoding TXREG or their mammalian homologs 

20 may be "knocked out" in an animal model system using homologous recombination in embryonic 

stem (ES) cells. Such techniques are well known in the art and are useful for the generation of animal 
models of human disease. (See, e.g., U.S. Patent No. 5,175,383 and U.S. Patent No. 5,767,337.) For 
example, mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse 
embryo and grown in culture. The ES cells are transformed with a vector containing the gene of 

25 interest disrupted by a marker gene, e.g., the neomycin phosphotransferase gene (neo; Capecchi, 

M.R. (1989) Science 244:1288-1292). The vector integrates into the corresponding region of the host 
genome by homologous recombination. Alternatively, homologous recombination takes place using 
the Cre-loxP system to knockout a gene of interest in a tissue- or developmental stage-specific 
manner (Marth, J.D. (1996) Clin. Invest. 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic Acids 

30 Res. 25:4323-4330). Transformed ES cells are identified and microinjected into mouse cell 

blastocysts such as those from the C57BL/6 mouse strain. The blastocysts are surgically transferred 
to pseudopregnant dams, and the resulting chimeric progeny are genotyped and bred to produce 
heterozygous or homozygous strains. Transgenic animals thus generated may be tested with potential 
therapeutic or toxic agents. 

35 Polynucleotides encoding TXREG may also be manipulated in vitro in ES cells derived from 
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human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et al. 
(1998) Science 282:1145-1147). 

5 Polynucleotides encoding TXREG can also be used to create n knockin M humanized animals 

(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a 
region of a polynucleotide encoding TXREG is injected into animal ES cells, and the injected 
sequence integrates into the animal cell genome. Transformed cells are injected into blastulae, and 
the blastulae are implanted as described above. Transgenic progeny or inbred lines are studied and 

10 treated with potential pharmaceutical agents to obtain information on treatment of a human disease. 
Alternatively, a mammal inbred to overexpress TXREG, e.g., by secreting TXREG in its milk, may 
also serve as a convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 4:55- 
74). 

THERAPEUTICS 

15 Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 

between regions of TXREG and human transcriptional regulator proteins. In addition, the expression 
of TXREG is closely associated with cell proliferation and inflammation. Therefore, TXREG 
appears to play a role in cell proliferative, autoimmune/inflammatory, and developmental disorders. 
In the treatment of disorders associated with increased TXREG expression or activity, it is desirable 

20 to decrease the expression or activity of TXREG. In the treatment of disorders associated with 
decreased TXREG expression or activity, it is desirable to increase the expression or activity of 
TXREG. 

Therefore, in one embodiment, TXREG or a fragment or derivative thereof may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 

25 activity of TXREG. Examples of such disorders include, but are not limited to, a cell proliferative 

disorder, such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed 
connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, 
polycythemia vera, psoriasis, primary thrombocythemia, and cancers including adenocarcinoma, 
leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of 

30 the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, 

gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, 
salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; an autoimmune/inflammatory 
disorder such as acquired immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory 
distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, atherosclerosis, 

35 autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune polyendocrinopathy- 
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candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact dermatitis, Crohn's 
disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, episodic lymphopenia 
with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, 
glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's thyroiditis, 
5 hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, myocardial or 
pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's 
syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic 
lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, 
Werner syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, 

10 bacterial, fungal, parasitic, protozoal, and helminthic infections, and trauma; and a developmental 
disorder, such as renal tubular acidosis, anemia, Cushing's syndrome, achondroplastic dwarfism, 
Duchenne and Becker muscular dystrophy, epilepsy, gonadal dysgenesis, WAGR syndrome (Wilms' 
tumor, aniridia, genitourinary abnormalities, and mental retardation), Smith-Magenis syndrome, 
myelodysplastic syndrome, hereditary mucoepithelial dysplasia, hereditary keratodermas, hereditary 

15 neuropathies such as Charcot-Marie-Tooth disease and neurofibromatosis, hypothyroidism, 
hydrocephalus, seizure disorders such as Syndenham's chorea and cerebral palsy, spina bifida, 
anencephaly, craniorachischisis, congenital glaucoma, cataract, and sensorineural hearing loss. 

In another embodiment, a vector capable of expressing TXREG or a fragment or derivative 
thereof may be administered to a subject to treat or prevent a disorder associated with decreased 

20 expression or activity of TXREG including, but not limited to, those described above. 

In a further embodiment, a pharmaceutical composition comprising a substantially purified 
TXREG in conjunction with a suitable pharmaceutical carrier may be administered to a subject to 
treat or prevent a disorder associated with decreased expression or activity of TXREG including, but 
not limited to, those provided above. 

25 In still another embodiment, an agonist which modulates the activity of TXREG may be 

administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of TXREG including, but not limited to, those listed above. 

In a further embodiment, an antagonist of TXREG may be administered to a subject to treat 
or prevent a disorder associated with increased expression or activity of TXREG. Examples of such 

30 disorders include, but are not limited to, those cell proliferative, autoimmune/inflammatory, and 
developmental disorders described above. In one aspect, an antibody which specifically binds 
TXREG may be used directly as an antagonist or indirectly as a targeting or delivery mechanism for 
bringing a pharmaceutical agent to cells or tissues which express TXREG. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 

35 encoding TXREG may be administered to a subject to treat or prevent a disorder associated with 
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increased expression or activity of TXREG including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 
sequences, or vectors of the invention may be administered in combination with other appropriate 
therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made 
5 by one of ordinary skill in the art, according to conventional pharmaceutical principles. The 

combination of therapeutic agents may act synergistically to effect the treatment or prevention of the 
various disorders described above. Using this approach, one may be able to achieve therapeutic 
efficacy with lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of TXREG may be produced using methods which are generally known in the 
10 art. In particular, purified TXREG may be used to produce antibodies or to screen libraries of 

pharmaceutical agents to identify those which specifically bind TXREG. Antibodies to TXREG may 
also be generated using methods that are well known in the art. Such antibodies may include, but are 
not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and 
fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit 
15 dimer formation) are generally preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 
and others may be immunized by injection with TXREG or with any fragment or oligopeptide thereof 
which has immunogenic properties. Depending on the host species, various adjuvants may be used to 
increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral 
20 gels such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic 
polyols, polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in 
humans, BCG (bacilli Calmette-Guerin) and Corvnebacterium parvum are especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 
TXREG have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
25 consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 

fragments are identical to a portion of the amino acid sequence of the natural protein. Short stretches 
of TXREG amino acids may be fused with those of another protein, such as KLH, and antibodies to 
the chimeric molecule may be produced. 

Monoclonal antibodies to TXREG may be prepared using any technique which provides for 
30 the production of antibody molecules by continuous cell lines in culture. These include, but are not 
limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma 
technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 
Immunol. Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad, Sci. USA 80:2026-2030; and 
Cole, S.P. et al. (1984) Mol. Cell Biol. 62:109-120.) 
35 In addition, techniques developed for the production of "chimeric antibodies," such as the 
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splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) Proc. 
Natl. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and Takeda, 
S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 

5 chain antibodies may be adapted, using methods known in the art, to produce TXREG-specific single 
chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be 
generated by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g., 
Burton, D.R. (1991) Proc. Natl. Acad. Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte 

10 population or by screening immunoglobulin libraries or panels of highly specific binding reagents as 
disclosed in the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 
86:3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for TXREG may also be generated. 
For example, such fragments include, but are not limited to, F(ab T ) 2 fragments produced by pepsin 

15 digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
the F(ab T )2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and 
easy identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. 
et al. (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 

20 specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 
immunoassays typically involve the measurement of complex formation between TXREG and its 
specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
reactive to two non-interfering TXREG epitopes is generally used, but a competitive binding assay 

25 may also be employed (Pound, supra) . 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay 
techniques may be used to assess the affinity of antibodies for TXREG. Affinity is expressed as an 
association constant, which is defined as the molar concentration of TXREG-antibody complex 
divided by the molar concentrations of free antigen and free antibody under equilibrium conditions. 

30 The IC, determined for a preparation of polyclonal antibodies, which are heterogeneous in their 

affinities for multiple TXREG epitopes, represents the average affinity, or avidity, of the antibodies 
for TXREG. The K> determined for a preparation of monoclonal antibodies, which are monospecific 
for a particular TXREG epitope, represents a true measure of affinity. High-affinity antibody 
preparations with ranging from about 10 9 to 10 12 L/mole are preferred for use in immunoassays in 

35 which the TXREG-antibody complex must withstand rigorous manipulations. Low-affinity antibody 
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preparations with K a ranging from about 10 6 to 10 7 L/mole are preferred for use in 
immunopurification and similar procedures which ultimately require dissociation of TXREG, 
preferably in active form, from the antibody (Catty, D. (1988) Antibodies. Volume I: A Practical 
Approach , IRL Press, Washington DC; Liddell, J.E. and A. Cryer (1991) A Practical Guide to 

5 Monoclonal Antibodies . John Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to 
determine the quality and suitability of such preparations for certain downstream applications. For 
example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, 
preferably 5-10 mg specific antibody/ml, is generally employed in procedures requiring precipitation 

10 of TXREG-antibody complexes. Procedures for evaluating antibody specificity, titer, and avidity, 
and guidelines for antibody quality and usage in various applications, are generally available. (See, 
e.g., Catty, supra , and Coligan et al., su^ra.) 

In another embodiment of the invention, the polynucleotides encoding TXREG, or any 
fragment or complement thereof, may be used for therapeutic purposes. In one aspect, modifications 

15 of gene expression can be achieved by designing complementary sequences or antisense molecules 
(DNA, RNA, PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene 
encoding TXREG. Such technology is well known in the art, and antisense oligonucleotides or larger 
fragments can be designed from various locations along the coding or control regions of sequences 
encoding TXREG. (See, e.g., Agrawal, S., ed. (1996) Antisense Therapeutics , Humana Press Inc., 

20 Totawa NJ.) 

In therapeutic use, any gene delivery system suitable for introduction of the antisense 
sequences into appropriate target cells can be used. Antisense sequences can be delivered 
intracellular^ in the form of an expression plasmid which, upon transcription, produces a sequence 
complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g., 

25 Slater, J.E. et al. (1998) J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon, K.J. et al. (1995) 

9(1 3): 1288- 1296.) Antisense sequences can also be introduced intracellularly through the use of viral 
vectors, such as retrovirus and adeno- associated virus vectors. (See, e.g., Miller, AX>. (1990) Blood 
76:271; Ausubel, supra ; Uckert, W. and W. Walther (1994) Pharmacol. Ther. 63(3)323-347.) Other 
gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, and other 

30 systems known in the art. (See, e.g., Rossi, J.J. (1995) Br. Med. Bull. 51(l):217-225; Boado, R.J. et 
' al. (1998) J. Pharm. Sci. 87(1 1):1308-1315; and Morris, M.C. et al. (1997) Nucleic Acids Res. 
25(14):2730-2736.) 

In another embodiment of the invention, polynucleotides encoding TXREG may be used for 
somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 
35 (e.g., in the cases of severe combined immunodeficiency (SCID)-Xl disease characterized by X- 
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linked inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined 
immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 
(Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), 
cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. (1995) Hum. Gene 
5 Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), thalassamias, familial 
hypercholesterolemia, and hemophilia resulting from Factor VIII or Factor DC deficiencies (Crystal, 
R.G. (1995) Science 270:404-410; Verma, I.M. and Somia, N. (1997) Nature 389:239-242)), (ii) 
express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
cell proliferation), or (iii) express a protein which affords protection against intracellular parasites 

10 (e.g., against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. 

(1988) Nature 335:395-396; Poeschla, E. et al. (1996) Proc. Natl. Acad. Sci. USA. 93:1 1395-1 1399), 
hepatitis B or C virus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 
brasiliensis ; and protozoan parasites such as Plasmodium falciparum and Trypanosoma cruzi) . In the 
case where a genetic deficiency in TXREG expression or regulation causes disease, the expression of 

15 TXREG from an appropriate population of transduced cells may alleviate the clinical manifestations 
caused by the genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in 
TXREG are treated by constructing mammalian expression vectors encoding TXREG and introducing 
these vectors by mechanical means into TXREG-deficient cells. Mechanical transfer technologies for 

20 use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) 
ballistic gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene 
transfer, and (v) the use of DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. 
Biochem. 62:191-217; Ivies, Z. (1997) Cell 91:501-510; Boulay, J-L. and H. Recipon (1998) Curr. 
Opin. Biotechnol. 9:445-450). 

25 Expression vectors that may be effective for the expression of TXREG include, but are not 

limited to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX vectors (Invitrogen, Carlsbad CA), 
PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), and PTET-OFF, 
PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). TXREG may be expressed 
using (i) a constitutively active promoter, (e.g., from cytomegalovirus (CMV), Rous sarcoma virus 

30 (RSV), SV40 virus, thymidine kinase (TK), or P-actin genes), (ii) an inducible promoter (e.g., the 
tetracycline-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. Natl. Acad. Sci. U.S.A. 
89:5547-5551; Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, F.M.V. and H.M. Blau (1998) 
Curr. Opin. Biotechnol. 9:451-456), commercially, available in the T-REX plasmid (Invitrogen)); the 
ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; Invitrogen); the 

35 FK506/rapamycin inducible promoter; or the RU486/mifepri stone inducible promoter (Rossi, F.M.V. 
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and H.M. Blau, su^ra)), or (iii) a tissue-specific promoter or the native promoter of the endogenous 
gene encoding TXREG from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 

5 polynucleotides to target cells in culture and require minimal effort to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, F.L. and A.J. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 
(1982) EMBO J. 1:841-845). The introduction of DNA to primary cells requires modification of 
these standardized mammalian transfection protocols. 

10 In another embodiment of the invention, diseases or disorders caused by genetic defects with 

respect to TXREG expression are treated by constructing a retrovirus vector consisting of (i) the 
polynucleotide encoding TXREG under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
element (RRE) along with additional retrovirus c^-acting RNA sequences and coding sequences 

15 required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 

commercially available (Stratagene) and are based on published data (Riviere/I. et al. (1995) Proc. 
Natl. Acad. Sci. U.S.A. 92:6733-6737), incorporated by reference herein. The vector is propagated in 
an appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for 
receptors on the target cells or a promiscuous envelope protein such as VSVg (Armentano, D. et al. 

20 (1987) J. Virol. 61:1647-1650; Bender, M.A. et al. (1987) J. Virol. 61:1639-1646; Adam, M.A. and 
A.D. Miller (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. 
et al. (1998) J. Virol. 72:9873-9880). U.S. Patent Number 5,910,434 to Rigg ("Method for obtaining 
retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") 
discloses a method for obtaining retrovirus packaging cell lines and is hereby incorporated by 

25 reference. Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4 + T- 

cells), and the return of transduced cells to a patient are procedures well known to persons skilled in 
the art of gene therapy and have been well documented (Ranga, U. et al. (1997) J. Virol. 71:7020- 
7029; Bauer, G. et al. (1997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. Virol. 71:4707-4716; 
Ranga, U. et al. (1998) Proc. Natl. Acad. Sci. U.S.A. 95:1201-1206; Su, L. (1997) Blood 89:2283- 

30 2290). 

In the alternative, an adenovirus-based gene therapy delivery system is used to deliver 
polynucleotides encoding TXREG to cells which have one or more genetic abnormalities with respect 
to the expression of TXREG. The construction and packaging of adenovirus-based vectors are well 
known to those with ordinary skill in the art. Replication defective adenovirus vectors have proven to 
35 be versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas 
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(Csete, M.E, et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are 
described in U.S. Patent Number 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), 
hereby incorporated by reference. For adenoviral vectors, see also Antinozzi, P. A. et al. (1999) 
Annu. Rev. Nutr. 19:53 1-544; and Verma, LM. and N. Somia (1997) Nature 18:389:239-242, both 
5 incorporated by reference herein. 

In another alternative, a herpes-based, gene therapy delivery system is used to deliver 
polynucleotides encoding TXREG to target cells which have one or more genetic abnormalities with 
respect to the expression of TXREG. The use of herpes simplex virus (HSV)-based vectors may be 
especially valuable for introducing TXREG to cells of the central nervous system, for which HSV has 

10 a tropism. The construction and packaging of herpes-based vectors are well known to those with 

ordinary skill in the art. A replication-competent herpes simplex virus (HSV) type 1 -based vector has 
been used to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. Eye 
Res. 169:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. 
Patent Number 5,804,413 to DeLuca (''Herpes simplex virus strains for gene transfer"), which is 

15 hereby incorporated by reference. U.S. Patent Number 5,804,413 teaches the use of recombinant 
HSV d92 which consists of a genome containing at least one exogenous gene to be transferred to a 
cell under the control of the appropriate promoter for purposes including human gene therapy. Also 
taught by this patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 
and ICP22. For HSV vectors, see also Goins, W.F. et al. (1999) J. Virol. 73:519-532 and Xu, H. et al. 

20 (1994) Dev. Biol. 163:152-161, hereby incorporated by reference. The manipulation of cloned 
herpesvirus sequences, the generation of recombinant virus following the transfection of multiple 
plasmids containing different segments of the large herpesvirus genomes, the growth and propagation 
of herpesvirus, and the infection of cells with herpesvirus are techniques well known to those of 
ordinary skill in the art. 

25 In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 

deliver polynucleotides encoding TXREG to target cells. The biology of the prototypic alphavirus, 
Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based 
on the SFV genome (Garoff, H. and K.-J. Li (1998) Curr. Opin. Biotech. 9:464-469). During 
alphavirus RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid 

30 proteins. This subgenomic RNA replicates to higher levels than the full-length genomic RNA, 

resulting in the overproduction of capsid proteins relative to the viral proteins with enzymatic activity 
(e.g., protease and polymerase). Similarly, inserting the coding sequence for TXREG into the 
alphavirus genome in place of the capsid-coding region results in the production of a large number of 
TXREG-coding RNAs and the synthesis of high levels of TXREG in vector transduced cells. While 

35 alphavirus infection is typically associated with cell lysis within a few days, the ability to establish a 
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persistent infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) 
indicates that the lytic replication of alphaviruses can be altered to suit the needs of the gene therapy 
application (Dryga, S.A. et al. (1997) Virology 228:74-83). The wide host range of alphaviruses will 
allow the introduction of TXREG into a variety of cell types. The specific transduction of a subset of 
5 cells in a population may require the sorting of cells prior to transduction. The methods of 
manipulating infectious cDNA clones of alphaviruses, performing alphavirus cDNA and RNA 
transfections, and performing alphavirus infections, are well known to those with ordinary skill in the 
art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions 

10 -10 and +10 from the start site, may also be employed to inhibit gene expression. Similarly, 

inhibition can be achieved using triple helix base-pairing methodology. Triple helix pairing is useful 
because it causes inhibition of the ability of the double helix to open sufficiently for the binding of 
polymerases, transcription factors, or regulatory molecules. Recent therapeutic advances using 
triplex DNA have been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber, B.E. 

15 and B.I. Carr, Molecular and Immunoloeic Approaches , Futura Publishing, Mt. Kisco NY, pp. 163- . 
177.) A complementary sequence or antisense molecule may also be designed to block translation of 
mRNA by preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 

20 molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of sequences encoding TXREG. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 

25 GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 
candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

30 Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared 

by any method known in the art for the synthesis of nucleic acid molecules. These include techniques 
for chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 
Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA 
sequences encoding TXREG. Such DNA sequences may be incorporated into a wide variety of 

35 vectors with suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA 
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constructs that synthesize complementary RNA, constitutively or inducibly, can be introduced into 
cell lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3* 
5 ends of the molecule, or the use of phosphorothioate or T O-methyl rather than phosphodiesterase 
linkages within the backbone of the molecule. This concept is inherent in the production of PNAs 
and can be extended in all of these molecules by the inclusion of nontraditional bases such as inosine, 
queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, 
cytidine, guanine, thymine, and uridine which are not as easily recognized by endogenous 
10 endonucleases. 

An additional embodiment of the invention encompasses a method for screening for a 
compound which is effective in altering expression of a polynucleotide encoding TXREG. 
Compounds which may be effective in altering expression of a specific polynucleotide may include, 
but are not limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming 

15 oligonucleotides, transcription factors and other polypeptide transcriptional regulators, and non- 
macromolecular chemical entities which are capable of interacting with specific polynucleotide 
sequences. Effective compounds may alter polynucleotide expression by acting as either inhibitors or 
promoters of polynucleotide expression. Thus, in the treatment of disorders associated with increased 
TXREG expression or activity, a compound which specifically inhibits expression of the 

20 polynucleotide encoding TXREG may be therapeutically useful, and in the treament of disorders 
associated with decreased TXREG expression or activity, a compound which specifically promotes 
expression of the polynucleotide encoding TXREG may be therapeutically useful 

At least one, and up to a plurality, of test compounds may be screened for effectiveness in 
altering expression of a specific polynucleotide. A test compound may be obtained by any method 

25 commonly known in the art, including chemical modification of a compound known to be effective in 
altering polynucleotide expression; selection from an existing, commercially-available or proprietary 
library of naturally-occurring or non-natural chemical compounds; rational design of a compound 
based on chemical and/or structural properties of the target polynucleotide; and selection from a 
library of chemical compounds created combinatorially or randomly. A sample comprising a 

30 polynucleotide encoding TXREG is exposed to at least one test compound thus obtained. The sample 
may comprise, for example, an intact or permeabilized cell, or an in vitro cell-free or reconstituted 
biochemical system. Alterations in the expression of a polynucleotide encoding TXREG are assayed 
by any method commonly known in the art. Typically, the expression of a specific nucleotide is 
detected by hybridization with a probe having a nucleotide sequence complementary to the sequence 

35 of the polynucleotide encoding TXREG. The amount of hybridization may be quantified, thus 
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forming the basis for a comparison of the expression of the polynucleotide both with and without 
exposure to one or more test compounds. Detection of a change in the expression of a polynucleotide 
exposed to a test compound indicates that the test compound is effective in altering the expression of 
the polynucleotide. A screen for a compound effective in altering expression of a specific 

5 polynucleotide can be carried out, for example, using a Schizosaccharomvces nombe gene expression 
system (Atkins, D. et al. (1999) U.S. Patent No. 5.932,435; Arndt, G.M. et al. (2000) Nucleic Acids 
Res. 28:E15) or a human cell line such as HeLa cell (Clarke, M.L. et al. (2000) Biochem. Biophys. 
Res. Commun. 268:8-13). A particular embodiment of the present invention involves screening a 
combinatorial library of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide 

10 nucleic acids, and modified oligonucleotides) for antisense activity against a specific polynucleotide 
sequence (Bruice, T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. 
Patent No. 6,022,691). 

Many methods for introducing vectors into cells or tissues are available and equally suitable 
for use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells 
15 taken from the patient and clonally propagated for autologous transplant back into that same patient. 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. (1997) Nat. 
Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of 
20 such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

An additional embodiment of the invention relates to the administration of a pharmaceutical 
composition which generally comprises an active ingredient formulated with a pharmaceutically 
acceptable excipient. Excipients may include, for example, sugars, starches, celluloses, gums, and 
25 proteins. Various formulations are commonly known and are thoroughly discussed in the latest 
edition of Remington's Pharmaceutical Sciences (Maack Publishing, Easton PA). Such 
pharmaceutical compositions may consist of TXREG, antibodies to TXREG, and mimetics, agonists, 
antagonists, or inhibitors of TXREG. 

The pharmaceutical compositions utilized in this invention may be administered by any 
30 number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, 

intramedullary, intrathecal, intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, 
intranasal, enteral, topical, sublingual, or rectal means. 

Pharmaceutical compositions for pulmonary administration may be prepared in liquid or dry 
powder form. These compositions are generally aerosolized immediately prior to inhalation by the 
35 patient. In the case of small molecules (e.g. traditional low molecular weight organic drugs), aerosol 
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delivery of fast-acting formulations is well-known in the art. In the case of macromolecules (e.g. 
larger peptides and proteins), recent developments in the field of pulmonary delivery via the alveolar 
region of the lung have enabled the practical delivery of drugs such as insulin to blood circulation 
(see, e.g., Patton, J.S. et al., U.S. Patent No. 5,997,848). Pulmonary delivery has the advantage of 
5 administration without needle injection, and obviates the need for potentially toxic penetration 
enhancers. 

Pharmaceutical compositions suitable for use in the invention include compositions wherein 
the active ingredients are contained in an effective amount to achieve the intended purpose. The 
determination of an effective dose is well within the capability of those skilled in the art. 

10 Specialized forms of pharmaceutical compositions may be prepared for direct intracellular 

delivery of macromolecules comprising TXREG or fragments thereof. For example, liposome 
preparations containing a cell-impermeable macromolecule may promote cell fusion and intracellular 
delivery of the macromolecule. Alternatively, TXREG or a fragment thereof may be joined to a short 
cationic N-terminal portion from the HIV Tat-1 protein. Fusion proteins thus generated have been 

15 found to transduce into the cells of all tissues, including the brain, in a mouse model system 
(Schwarze, S.R. et al. (1999) Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, 
monkeys, or pigs. An animal model may also be used to determine the appropriate concentration 

20 range and route of administration. Such information can then be used to determine useful doses and 
routes for administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 
TXREG or fragments thereof, antibodies of TXREG, and agonists, antagonists or inhibitors of 
TXREG, which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be 

25 determined by standard pharmaceutical procedures in cell cultures or with experimental animals, such 
as by calculating the ED 50 (the dose therapeutically effective in 50% of the population) or LD 50 (the 
dose lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the LD 5O /ED 50 ratio. Pharmaceutical compositions 
which exhibit large therapeutic indices are preferred. The data obtained from cell culture assays and 

30 animal studies are used to formulate a range of dosage for human use. The dosage contained in such 
compositions is preferably within a range of circulating concentrations that includes the ED 50 with 
little or no toxicity. The dosage varies within this range depending upon the dosage form employed, 
the sensitivity of the patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 

35 subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the 



WO 00/78954 



PCT7US00/16766 



active moiety or to maintain the desired effect. Factors which may be taken into account include the 
severity of the disease state, the general health of the subject, the age, weight, and gender of the 
subject, time and frequency of administration, drug combination(s), reaction sensitivities, and 
response to therapy. Long-acting pharmaceutical compositions may be administered every 3 to 4 
5 days, every week, or biweekly depending on the half-life and clearance rate of the particular 
formulation. 

Normal dosage amounts may vary from about 0.1 y,g to 100,000 jug, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
10 Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind TXREG may be used for the 

15 diagnosis of disorders characterized by expression of TXREG, or in assays to monitor patients being 
treated with TXREG or agonists, antagonists, or inhibitors of TXREG. Antibodies useful for 
diagnostic purposes may be prepared in the same manner as described above for therapeutics. 
Diagnostic assays for TXREG include methods which utilize the antibody and a label to detect 
TXREG in human body fluids or in extracts of cells or tissues. The antibodies may be used with or 

20 without modification, and may be labeled by covalent or non-covalent attachment of a reporter 

molecule. A wide variety of reporter molecules, several of which are described above, are known in 
the art and may be used. 

A variety of protocols for measuring TXREG, including ELISAs, RIAs, and FACS, are 
known in the art and provide a basis for diagnosing altered or abnormal levels of TXREG expression. 

25 Normal or standard values for TXREG expression are established by combining body fluids or cell 
extracts taken from normal mammalian subjects, for example, human subjects, with antibody to 
TXREG under conditions suitable for complex formation. The amount of standard complex 
formation may be quantitated by various methods, such as photometric means. Quantities of TXREG 
expressed in subject, control, and disease samples from biopsied tissues are compared with the 

30 standard values. Deviation between standard and subject values establishes the parameters for 
diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding TXREG may be used 
for diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect 
35 and quantify gene expression in biopsied tissues in which expression of TXREG may be correlated 
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with disease. The diagnostic assay may be used to determine absence, presence, and excess 
expression of TXREG, and to monitor regulation of TXREG levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding TXREG or closely related molecules may be used 
5 to identify nucleic acid sequences which encode TXREG. The specificity of the probe, whether it is 
made from a highly specific region, e.g., the 5* regulatory region, or from a less specific region, e.g., a 
conserved motif, and the stringency of the hybridization or amplification will determine whether the 
probe identifies only naturally occurring sequences encoding TXREG, allelic variants, or related 
sequences. 

10 Probes may also be used for the detection of related sequences, and may have at least 50% 

sequence identity to any of the TXREG encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:33-64 or from 
genomic sequences including promoters, enhancers, and introns of the TXREG gene. 

Means for producing specific hybridization probes forJDNAs encoding TXREG include the 

15 cloning of polynucleotide sequences encoding TXREG or TXREG derivatives into vectors for the 

production of mRN A probes. Such vectors are known in the art, are commercially available, and may 
be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 
variety of reporter groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, 

20 such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding TXREG may be used for the diagnosis of disorders 
associated with expression of TXREG. Examples of such disorders include, but are not limited to, a 
cell proliferative disorder, such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, 
cirrhosis, hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 

25 hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, 
parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; an 

30 autoimmune/inflammatory disorder such as acquired immunodeficiency syndrome (AJDS), 

Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, 
anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, 
autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, 
cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes 

35 mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, 
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erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' 
disease, Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, 
myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, 
polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, 

5 systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, 
ulcerative colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and 
extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and 
trauma; and a developmental disorder, such as renal tubular acidosis, anemia, Cushing's syndrome, 
achondroplastic dwarfism, Duchenne and Becker muscular dystrophy, epilepsy, gonadal dysgenesis, 

10 WAGR syndrome (Wilms' tumor, aniridia, genitourinary abnormalities, and mental retardation), 
Smith-Magenis syndrome, myelodysplastic syndrome, hereditary mucoepithelial dysplasia, 
hereditary keratodermas, hereditary neuropathies such as Charcot-Marie-Tooth disease and 
neurofibromatosis, hypothyroidism, hydrocephalus, seizure disorders such as Syndenham's chorea 
and cerebral palsy, spina bifida, anencephaly, craniorachischisis, congenital glaucoma, cataract, and 

15 sensorineural hearing loss. The polynucleotide sequences encoding TXREG may be used in Southern 
or northern analysis, dot blot, or other membrane-based technologies; in PCR technologies; in 
dipstick, pin, and multiformat ELISA-like assays; and in microarrays utilizing fluids or tissues from 
patients to detect altered TXREG expression. Such qualitative or quantitative methods are well 
known in the art. 

20 In a particular aspect, the nucleotide sequences encoding TXREG may be useful in assays 

that detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding TXREG may be labeled by standard methods and added to a fluid or tissue 
sample from a patient under conditions suitable for the formation of hybridization complexes. After a 
suitable incubation period, the sample is washed and the signal is quantified and compared with a 

25 standard value. If the amount of signal in the patient sample is significantly altered in comparison to 
a control sample then the presence of altered levels of nucleotide sequences encoding TXREG in the 
sample indicates the presence of the associated disorder. Such assays may also be used to evaluate 
the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to 
monitor the treatment of an individual patient. 

30 In order to provide a basis for the diagnosis of a disorder associated with expression of 

TXREG, a normal or standard profile for expression is established. This may be accomplished by 
combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 
sequence, or a fragment thereof, encoding TXREG, under conditions suitable for hybridization or 
amplification. Standard hybridization may be quantified by comparing the values obtained from 

35 normal subjects with values from an experiment in which a known amount of a substantially purified 
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polynucleotide is used. Standard values obtained in this manner may be compared with values 
obtained from samples from patients who are symptomatic for a disorder. Deviation from standard 
values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
5 hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject. The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 

10 overexpressed) in biopsied tissue from an individual may indicate a predisposition for the 

development of the disease, or may provide a means for detecting the disease prior to the appearance 
of actual clinical symptoms. A more definitive diagnosis of this type may allow health professionals 
to employ preventative measures or aggressive treatment earlier thereby preventing the development 
or further progression of the cancer. 

15 Additional diagnostic uses for oligonucleotides designed from the sequences encoding 

TXRJEG may involve the use of PCR. These oligomers may be chemically synthesized, generated 
enzymatically, or produced in vitro . Oligomers will preferably contain a fragment of a polynucleotide 
encoding TXREG, or a fragment of a polynucleotide complementary to the polynucleotide encoding 
TXREG, and will be employed under optimized conditions for identification of a specific gene or 

20 condition. Oligomers may also be employed under less stringent conditions for detection or 
quantification of closely related DNA or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from the polynucleotide sequences 
encoding TXREG may be used to detect single nucleotide polymorphisms (SNPs). SNPs are 
substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic 

25 disease in humans. Methods of SNP detection include, but are not limited to, single-stranded 
conformation polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, 
oligonucleotide primers derived from the polynucleotide sequences encoding TXREG are used to 
amplify DNA using the polymerase chain reaction (PCR). The DNA may be derived, for example, 
from diseased or normal tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause 

30 differences in the secondary and tertiary structures of PCR products in single-stranded form, and 
these differences are detectable using gel electrophoresis in non-denaturing gels. In fSCCP, the 
oligonucleotide primers are fluorescently labeled, which allows detection of the amplimers in high- 
throughput equipment such as DNA sequencing machines. Additionally, sequence database analysis 
methods, termed in silico SNP (isSNP), are capable of identifying polymorphisms by comparing the 

35 sequence of individual overlapping DNA fragments which assemble into a common consensus 
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sequence. These computer-based methods filter out sequence variations due to laboratory preparation 
of DNA and sequencing errors using statistical models and automated analyses of DNA sequence 
chromatograms. In the alternative, SNPs may be detected and characterized by mass spectrometry 
using, for example, the high throughput MASSARRAY system (Sequenom, Inc., San Diego CA). 

5 Methods which may also be used to quantify the expression of TXREG include radiolabeling 

or biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol: Methods 159:235-244; Duplaa, C. 
et al. (1993) Anal. Biochern. 212:229-236.) The speed of quantitation of multiple samples may be 
accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide of 

10 interest is presented in various dilutions and a spectrophotometry or colorimetric response gives 
rapid quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as elements on a microarray. The microarray 
can be used in transcript imaging techniques which monitor the relative expression levels of large 

15 numbers of genes simultaneously as described in Seilhamer, J J. et al., "Comparative Gene Transcript 
Analysis/' U.S. Patent No. 5,840,484, incorporated herein by reference. The microarray may also be 
used to identify genetic variants, mutations, and polymorphisms. This information may be used to 
determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, to 
monitor progression/regression of disease as a function of gene expression, and to develop and 

20 monitor the activities of therapeutic agents in the treatment of disease. In particular, this information 
may be used to develop a pharmacogenomic profile of a patient in order to select the most appropriate 
and effective treatment regimen for that patient. For example, therapeutic agents which are highly 
effective and display the fewest side effects may be selected for a patient based on his/her 
pharmacogenomic profile. 

25 In another embodiment, antibodies specific for TXREG, or TXREG or fragments thereof may 

be used as elements on a microarray. The microarray may be used to monitor or measure protein- 
protein interactions, drug-target interactions, and gene expression profiles, as described above. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., 
Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 

30 USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251 1 16; Shalon, D. et al. 
(1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150- 
2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) Various types of microarrays are 
well known and thoroughly described in DNA Microarrays: A Practical Approach . M. Schena, ed. 
(1999) Oxford University Press, London, hereby expressly incorporated by reference. 

35 In another embodiment of the invention, nucleic acid sequences encoding TXREG may be 
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used to generate hybridization probes useful in mapping the naturally occurring genomic sequence. 
Either coding or noncoding sequences may be used, and in some instances, noncoding sequences may 
be preferable over coding sequences. For example, conservation of a coding sequence among 
members of a multi-gene family may potentially cause undesired cross hybridization during 
5 chromosomal mapping. The sequences may be mapped to a particular chromosome, to a specific 
region of a chromosome, or to artificial chromosome constructions, e.g., human artificial 
chromosomes (HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes 
(BACs), bacterial PI constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J. 
et al. (1997) Nat. Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B J. 

10 (1991) Trends Genet. 7:149-154.) Once mapped, the nucleic acid sequences of the invention may be 
used to develop genetic linkage maps, for example, which correlate the inheritance of a disease state 
with the inheritance of a particular chromosome region or restriction fragment length polymorphism 
(RFLP). (See, e.g., Lander, E.S. and D. Botstein (1986) Proc. Natl. Acad. Sci. USA 83:7353-7357.) 
Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic 

15 map data. (See, e.g., Heinz-Ulrich, et al. (1995) in Meyers, supra , pp. 965-968.) Examples of genetic 
map data can be found in various scientific journals or at the Online Mendelian Inheritance in Man 
(OMIM) World Wide Web site. Correlation between the location of the gene encoding TXREG on a 
physical map and a specific disorder, or a predisposition to a specific disorder, may help define the 
region of DNA associated with that disorder and thus may further positional cloning efforts. 

20 In situ hybridization of chromosomal preparations and physical mapping techniques, such as 

linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, 
may reveal associated markers even if the exact chromosomal locus is not known. This information is 
valuable to investigators searching for disease genes using positional cloning or other gene discovery 

25 techniques. Once the gene or genes responsible for a disease or syndrome have been crudely 

localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, 
any sequences mapping to that area may represent associated or regulatory genes for further 
investigation. (See, e.g., Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide sequence of 
the instant invention may also be used to detect differences in the chromosomal location due to 

30 translocation, inversion, etc., among normal, carrier, or affected individuals. 

In another embodiment of the invention, TXREG, its catalytic or immunogenic fragments, or 
oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
solid support, borne on a cell surface, or located intracellularly. The formation of binding complexes 

35 between TXREG and the agent being tested may be measured. 
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Another technique for drug screening provides for high throughput screening of compounds 
having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et aL (1984) PCT 
application WO84/03564.) In this method, large numbers of different small test compounds are 
synthesized on a solid substrate. The test compounds are reacted with TXREG, or fragments thereof, 
5 and washed. Bound TXREG is then detected by methods well known in the art. Purified TXREG 
can also be coated directly onto plates for use in the aforementioned drug screening techniques. 
Alternatively, non-neutralizing antibodies can be used to capture the peptide and immobilize it on a 
solid support. 

In another embodiment, one may use competitive drug screening assays in which neutralizing 
10 antibodies capable of binding TXREG specifically compete with a test compound for binding 

TXREG. In this manner, antibodies can be used to detect the presence of any peptide which shares 
one or more antigenic determinants with TXREG. 

In additional embodiments, the nucleotide sequences which encode TXREG may be used in 
any molecular biology techniques that have yet to be developed, provided the new techniques rely on 
15 properties of nucleotide sequences that are currently known, including, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 
description, utilize the present invention to its fullest extent. The following preferred specific 
— \ embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder 
20 of the disclosure in any way whatsoever. 

The disclosures of all patents, applications, and publications mentioned above and below, in 
particular U.S. Ser. No.60/140,109 are hereby expressly incorporated by reference. 

EXAMPLES 

25 I, Construction of cDNA Libraries 

RNA was purchased from Clontech or isolated from tissues described in Table 4. Some 
tissues were homogenized and lysed in guanidinium isothiocyanate, while others were homogenized 
and lysed in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life Technologies), a 
monophasic solution of phenol and guanidine isothiocyanate. The resulting lysates were centrifuged 

30 over CsCl cushions or extracted with chloroform. RNA was precipitated from the lysates with either 
tsopropanol or sodium acetate and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was isolated 
using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (Q1AGEN, 

35 Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 
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isolated directly from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 
libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 
5 vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 

recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, supra , units 
5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 
oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 
appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300- 

10 1000 bp) using SEPHACRYL S 1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column 

chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., 
PBLUESCRIPT plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), pcDNA2.1 plasmid 
(Invitrogen, Carlsbad CA), or pINCY plasmid (Incyte Genomics, Palo Alto CA). Recombinant 

15 plasmids were transformed into competent E. coli cells including XLl-Blue, XLl-BlueMRF, or 
SOLR from Stratagene or DH5a, DH10B, or ElectroMAX DH10B from Life Technologies. 
II. Isolation of cDNA Clones 

Plasmids obtained as described in Example I were recovered from host cells by in vivo 
excision using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified using 

20 at least one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an 
AGTC Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, 
QIAWELL 8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R.E.A.L. PREP 96 
plasmid purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0.1 
ml of distilled water and stored, with or without lyophilization, at 4°C. 

25 Alternatively, plasmid DNA was amplified from host cell lysates using direct link PGR in a 

high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 
384-well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically 
using PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence 

30 scanner (Labsystems Oy, Helsinki, Finland), 
in. Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example II were sequenced as follows. 
Sequencing reactions were processed using standard methods or high-throughput instrumentation 
such as the ABI CATALYST 800 (PE Biosystems) thermal cycler or the PTC-200 thermal cycler (MJ 

35 Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the MICROLAB 
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2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared using reagents 
provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such as the ABI 
PRISM BIGDYE Terminator cycle sequencing ready reaction kit (PE Biosystems). Electrophoretic 
separation of cDNA sequencing reactions and detection of labeled polynucleotides were carried out 

5 using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the ABI PRISM 373 
or 377 sequencing system (PE Biosystems) in conjunction with standard ABI protocols and base 
calling software; or other sequence analysis systems known in the art. Reading frames within the 
cDNA sequences were identified using standard methods (reviewed in Ausubel, 1997, supra , unit 
7.7). Some of the cDNA sequences were selected for extension using the techniques disclosed in 

10 Example VI. 

The polynucleotide sequences derived from cDNA sequencing were assembled and analyzed 
using a combination of software programs which utilize algorithms well known to those skilled in the 
art. Table 5 summarizes the tools, programs, and algorithms used and provides applicable 
descriptions, references, and threshold parameters. The first column of Table 5 shows the tools, 

15 programs, and algorithms used, the second column provides brief descriptions thereof, the third 
column presents appropriate references, all of which are incorporated by reference herein in their 
entirety, and the fourth column presents, where applicable, the scores, probability values, and other 
parameters used to evaluate the strength of a match between two sequences (the higher the score, the 
greater the homology between two sequences). Sequences were analyzed using MACDNASIS PRO 

20 software (Hitachi Software Engineering, South San Francisco CA) and LASERGENE software 

(DNASTAR). Polynucleotide and polypeptide sequence alignments were generated using the default 
parameters specified by the clustal algorithm as incorporated into the MEGALIGN multisequence 
alignment program (DNASTAR), which also calculates the percent identity between aligned 
sequences. 

25 The polynucleotide sequences were validated by removing vector, linker, and polyA 

sequences and by masking ambiguous bases, using algorithms and programs based on BLAST, 
dynamic programing, and dinucleotide nearest neighbor analysis. The sequences were then queried 
against a selection of public databases such as the GenBank primate, rodent, mammalian, vertebrate, 
and eukaryote databases, and BLOCKS, PRINTS, DOMO, PRODOM, and PFAM to acquire 

30 annotation using programs based on BLAST, FASTA, and BLIMPS. The sequences were assembled 
into full length polynucleotide sequences using programs based on Phred, Phrap, and Consed, and 
were screened for open reading frames using programs based on GeneMark, BLAST, and FASTA. 
The full length polynucleotide sequences were translated to derive the corresponding full length 
amino acid sequences, and these full length sequences were subsequently analyzed by querying 

35 against databases such as the GenBank databases (described above), SwissProt, BLOCKS, PRINTS, 
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DOMO, PRODOM, Prosite, and Hidden Markov Model (HMM)-based protein family databases such 
as PFAM. HMM is a probabilistic approach which analyzes consensus primary structures of gene 
families. (See, e.g., Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) 

The programs described above for the assembly and analysis of full length polynucleotide 
5 and amino acid sequences were also used to identify polynucleotide sequence fragments from SEQ ED 
NO:33-64. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies were described in The Invention section above. 
IV. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 
10 gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs 
from a particular cell type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; Ausubel, 
1 995, supra , ch. 4 and 1 6.) 

Analogous computer techniques applying BLAST were used to search for identical or related 
molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is 
15 much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the 

computer search can be modified to determine whether any particular match is categorized as exact or 
similar. The basis of the search is the product score, which is defined as: 

BLAST Score x Percent Identity 

5 x minimum {length(Seq. 1), length(Seq. 2)} 

20 

The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. The product score is a normalized value between 0 and 100, and is 
calculated as follows: the BLAST score is multiplied by the percent nucleotide identity and the 
product is divided by (5 times the length of the shorter of the two sequences). The BLAST score is 

25 calculated by assigning a score of +5 for every base that matches in a high-scoring segment pair 
(HSP), and -4 for every mismatch. Two sequences may share more than one HSP (separated by 
gaps). If there is more than one HSP, then the pair with the highest BLAST score is used to calculate 
the product score. The product score represents a balance between fractional overlap and quality in a 
BLAST alignment. For example, a product score of 100 is produced only for 100% identity over the 

30 entire length of the shorter of the two sequences being compared. A product score of 70 is produced 
either by 100% identity and 70% overlap at one end, or by 88% identity and 100% overlap at the 
other. A product score of 50 is produced either by 100% identity and 50% overlap at one end, or 79% 
identity and 100% overlap. 

The results of northern analyses are reported as a percentage distribution of libraries in which 

35 the transcript encoding TXREG occurred. Analysis involved the categorization of cDNA libraries by 
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organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, 
developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, 
reproductive, and urologic. The disease/condition categories included cancer, inflammation, trauma, 
cell proliferation, neurological, and pooled. For each category, the number of libraries expressing the 
5 sequence of interest was counted and divided by the total number of libraries across all categories. 
Percentage values of tissue-specific and disease- or condition-specific expression are reported in 
Table 3. 

V, Chromosomal Mapping of TXREG Encoding Polynucleotides 

The cDNA sequences which were used to assemble SEQ ID NO:33-64 were compared with 

10 sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith-Waterman algorithm. Sequences from these databases that matched 
SEQ ID NO:33-64 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 5). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 

15 Genome Research (W1GR), and Genethon were used to determine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 
of all sequences of that cluster, including its particular SEQ ID NO:, to that map location. 

The genetic map locations of SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID 
NO:36, SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:47, SEQ ID 

20 NO:61, SEQ ID NO:62, and SEQ ID NO:64 are described in The Invention as ranges, or intervals, of 
human chromosomes. More than one map location is reported for SEQ ID NO:33, SEQ ID NO:36, 
and SEQ ID NO:64, indicating that previously mapped sequences having similarity, but not complete 
identity, to SEQ ID NO:33, SEQ ID NO:36, and SEQ ID NO:64 were assembled into their respective . 
clusters. The map position of an interval, in centiMorgans, is measured relative to the terminus of the 

25 chromosome's p-arm. (The centiMorgan (cM) is a unit of measurement based on recombination 

frequencies between chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase 
(Mb) of DNA in humans, although this can vary widely due to hot and cold spots of recombination.) 
The cM distances are based on genetic markers mapped by Genethon which provide boundaries for 
radiation hybrid markers whose sequences were included in each of the clusters. Diseases associated 

30 with the public and Incyte sequences located within the indicated intervals are also reported in the 
Invention where applicable. 

VI. Extension of TXREG Encoding Polynucleotides 

The full length nucleic acid sequences of SEQ ID NO:33-64 were produced by extension of 
an appropriate fragment of the full length molecule using oligonucleotide primers designed from this 
35 fragment. One primer was synthesized to initiate 5' extension of the known fragment, and the other 
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primer, to initiate 3' extension of the known fragment. The initial primers were designed using 
OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 
nucleotides in length, to have a GC content of about 50% or more, and to anneal to the target 
sequence at temperatures of about 68°C to about 72°C. Any stretch of nucleotides which would 
5 result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 
extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PCR using methods well known in the art. PCR 
was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 

10 mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2 *, (NH 4 ) 2 S0 4 , 
and p-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme 
(Life Technologies), and Pfu DNA polymerase (Stratagene), with the following parameters for primer 
pair PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 
2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. In the 

15 alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1 : 94 °C, 3 min; Step 2: 
94 °C, 15 sec; Step 3: 57 °C, 1 min; Step 4: 68 °C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: 68 °C, 5 min; Step 7: storage at 4°C. 

The concentration of DNA in each well was determined by dispensing 100 jil PICOGREEN 
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 

20 and 0.5 ^1 of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, 
Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
concentration of DNA. A 5 ju\ to 10 {A aliquot of the reaction mixture was analyzed by 
electrophoresis on a 1 % agarose mini-gel to determine which reactions were successful in extending 

25 the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 
digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 

30 agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended clones 
were religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 
Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site 
overhangs, and transfected into competent E. coli cells. Transformed cells were selected on 
antibiotic-containing media, and individual colonies were picked and cultured overnight at 37 °C in 

35 384-well plates in LB/2x carb liquid media. 
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The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 
(Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following 
parameters: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; 
Step 5: steps 2, 3, and 4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was 
5 quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA 
recoveries were reamplified using the same conditions as described above. Samples were diluted 
with 20% dimethysulfoxide (1 :2, v/v), and sequenced using DYENAMIC energy transfer sequencing 
primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM 
BIGDYE Terminator cycle sequencing ready reaction kit (PE Biosystems). 
10 In like manner, the polynucleotide sequences of SEQ ID NO:33-64 are used to obtain 5' 

regulatory sequences using the procedure above, along with oligonucleotides designed for such 
extension, and an appropriate genomic library. 
VII. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:33-64 are employed to screen cDNAs, 
15 genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 
pairs, is specifically described, essentially the same procedure is used with larger nucleotide 
fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 j^Ci of 
[y- 32 P] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 
20 (DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a 

SEPHADEX G-25 superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). 
An aliquot containing 10 7 counts per minute of the labeled probe is used in a typical membrane-based 
hybridization analysis of human genomic DNA digested with one of the following endonucleases: 
Ase I, Bgl H, Eco RI, Pst I, Xba I, or Pvu H (DuPont NEN). 
25 The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 

membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40 °C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
30 compared. 

VIII. Microarrays 

The linkage or synthesis of array elements upon a microarray can be achieved utilizing 
photolithography, piezoelectric printing (ink-jet printing, See, e.g., Baldeschweiler, supra ), 
mechanical microspotting technologies, and derivatives thereof. The substrate in each of the 
35 aforementioned technologies should be uniform and solid with a non-porous surface (Schena (1999), 
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supra ). Suggested substrates include silicon, silica, glass slides, glass chips, and silicon wafers. 
Alternatively, a procedure analogous to a dot or slot blot may also be used to arrange and link 
elements to the surface of a substrate using thermal, UV, chemical, or mechanical bonding 
procedures. A typical array may be produced using available methods and machines well known to 
5 those of ordinary skill in the art and may contain any appropriate number of elements. (See, e.g., 
Schena, M. et al. (1995) Science 270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645; 
Marshall, A. and J. Hodgson (1998) Nat. Biotechnol. 16:27-31.) 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof 
may comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can 

10 be selected using software well known in the art such as LASERGENE software (DNASTAR). The 
array elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. 
After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 
fluorescence scanner is used to detect hybridization at each array element. Alternatively, laser 

15 desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 

complementarity and the relative abundance of each polynucleotide which hybridizes to an element 
on the microarray may be assessed. In one embodiment, microarray preparation and usage is 
described in detail below. 
Tissue or Cell Sample Preparation 

20 Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 

poly(A) + RNA is purified using the oligo-(dT) cellulose method. Each poly(A) + RNA sample is 
reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/^il oligo-(dT) primer (21mer), IX 
first strand buffer, 0.03 units/|il RNase inhibitor, 500 dATP, 500 dGTP, 500 jiM dTTP, 40 
MM dCTP, 40 mM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 

25 transcription reaction is performed in a 25 ml volume containing 200 ng poly(A) + RNA with 

GEMB RIGHT kits (Incyte). Specific control poly(A) + RNAs are synthesized by in vitro transcription 
from non-coding yeast genomic DNA. After incubation at 37 °C for 2 hr, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and 
incubated for 20 minutes at 85 °C to the stop the reaction and degrade the RNA. Samples are purified 

30 using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 
(CLONTECH), Palo Alto CA) and after combining, both reaction samples are ethanol precipitated 
using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The sample is 
then dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and 
resuspended in 14 fil 5X SSC/0.2% SDS. 

35 Microarray Preparation 
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Sequences of the present invention are used to generate array elements. Each array element 
is amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification 
uses primers complementary to the vector sequences flanking the cDNA insert. Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 
5 |ig. Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia 
Biotech). 

Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
slides (Corning) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
10 Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, 
and coated with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 
110°C oven. 

Array elements are applied to the CQated glass substrate using a procedure described in US 

Patent No. 5,807,522, incorporated herein by reference. 1 p\ of the array element DNA, at an average 
15 concentration of 100 ng/(il, is loaded into the open capillary printing element by a high-speed robotic 

apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 

Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 

Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
20 buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60 °C followed by washes in 

0.2% SDS and distilled water as before. 

Hybridization 

Hybridization reactions contain 9 |il of sample mixture consisting of 0.2 each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 

25 mixture is heated to 65 °C for 5 minutes and is aliquoted onto the microarray surface and covered 
with an 1.8 cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just 
slightly larger than a microscope slide. The chamber is kept at 100% humidity internally by the 
addition of 140 jil of 5X SSC in a corner of the chamber. The chamber containing the arrays is 
incubated for about 6.5 hours at 60 °C. The arrays are washed for 10 min at 45 °C in a first wash 

30 buffer (IX SSC, 0.1% SDS), three times for 10 minutes each at 45 °C in a second wash buffer (0.1X 
SSC), and dried. 
Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
35 at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 
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focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide \ 
containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1.8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 
5 In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 

Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 
Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 

10 typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 
although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentration. A specific location on 
the array contains a complementary DNA sequence, allowing the intensity of the signal at that 

15 location to be correlated with a weight ratio of hybridizing species of 1:100,000. When two samples 
from different sources (e.g., representing test and control cells), each labeled with a different 
fluorophore, are hybridized to a single array for the purpose of identifying genes that are 
differentially expressed, the calibration is done by labeling samples of the calibrating cDNA with the 
two fluorophores and adding identical amounts of each to the hybridization mixture. 

20 The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 

(A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 
computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 

25 measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping 
emission spectra) between the fluorophores using each fluorophore' s emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each 
spot is centered in each element of the grid. The fluorescence signal within each element is then 
integrated to obtain a numerical value corresponding to the average intensity of the signal. The 

30 software used for signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 
IX. Complementary Polynucleotides 

Sequences complementary to the TXREG-encoding sequences, or any parts thereof, are used 
to detect, decrease, or inhibit expression of naturally occurring TXREG. Although use of 
oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 

35 procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides are 
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designed using OLIGO 4.06 software (National Biosciences) and the coding sequence of TXREG. 
To inhibit transcription, a complementary oligonucleotide is designed from the most unique 5' 
sequence and used to prevent promoter binding to the coding sequence. To inhibit translation, a 
complementary oligonucleotide is designed to prevent ribosomal binding to the TXREG-encoding 

5 transcript. 

X. Expression of TXREG 

Expression and purification of TXREG is achieved using bacterial or virus-based expression 
systems. For expression of TXREG in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 

10 transcription. Examples of such promoters include, but are not limited to, the trp-lac (tac) hybrid 
promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 
element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express TXREG upon induction with isopropyl beta-D- 
thiogalactopyranoside (IPTG). Expression of TXREG in eukaryotic cells is achieved by infecting 

15 insect or mammalian cell lines with recombinant Autographica californica nuclear polyhedrosis virus 
(AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 
replaced with cDNA encoding TXREG by either homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 

20 infect Spodoptera frusiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 
Infection of the latter requires additional genetic modifications to baculovirus. (See Engelhard, E.K. 
et al. (1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 
7:1937-1945.) 

In most expression systems, TXREG is synthesized as a fusion protein with, e.g., glutathione 
25 S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26- 
kilodalton enzyme from Schistosoma iaponicum . enables the purification of fusion proteins on 
immobilized glutathione under conditions that maintain protein activity and antigenicity (Amersham 
Pharmacia Biotech). Following purification, the GST moiety can be proteolytically cleaved from 
30 TXREG at specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity 

purification using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman 
Kodak). 6-His, a stretch of six consecutive histidine residues, enables purification on metal-chelate 
resins (QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, 
supra , ch. 10 and 16). Purified TXREG obtained by these methods can be used directly in the assays 
35 shown in Examples XI and XV. 
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XL Demonstration of TXREG Activity 

TXREG activity is measured by its ability to stimulate transcription of a reporter gene (Liu, 
H.Y. et al. (1997) EMBO J. 16(17):5289-5298). The assay employs a well characterized reporter 
gene construct, LexA^-LacZ, that consists of LexA DNA transcriptional control elements (LexA op ) 
5 fused to sequences encoding the E. coli LacZ enzyme. The methods for constructing and expressing 
fusions genes, introducing them into cells, and measuring LacZ enzyme activity, are well known to 
those skilled in the art. Sequences encoding TXREG are cloned into a plasmid that directs the 
synthesis of a fusion protein, LexA-TXREG, consisting of TXREG and a DNA binding domain 
derived from the LexA transcription factor. The resulting plasmid, encoding a LexA-TXREG fusion 

10 protein, is introduced into yeast cells along with a plasmid containing the LexA op -LacZ reporter gene. 
The amount of LacZ enzyme activity associated with LexA-TXREG transfected cells, relative to 
control cells, is proportional to the amount of transcription stimulated by the TXREG. 
XII. Functional Assays 

TXREG function is assessed by expressing the sequences encoding TXREG at 

15 physiologically elevated levels in mammalian cell culture systems. cDNA is subcloned into a 
mammalian expression vector containing a strong promoter that drives high levels of cDNA 
expression. Vectors of choice include pCMV SPORT plasmid (Life Technologies) and pCR3.1 
plasmid (Invitrogen), both of which contain the cytomegalovirus promoter. 5-10 yUg of recombinant 
vector are transiently transfected into a human cell line, for example, an endothelial or hematopoietic 

20 cell line, using either liposome formulations or electroporation. 1-2 yug of an additional plasmid 

containing sequences encoding a marker protein are co-transfected. Expression of a marker protein 
provides a means to distinguish transfected cells from nontransfected cells and is a reliable predictor 
of cDNA expression from the recombinant vector. Marker proteins of choice include, e.g., Green 
Fluorescent Protein (GFP; Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), 

25 an automated, laser optics-based technique, is used to identify transfected cells expressing GFP or 
CD64-GFP and to evaluate the apoptotic state of the cells and other cellular properties. FCM detects 
and quantifies the uptake of fluorescent molecules that diagnose events preceding or coincident with 
cell death. These events include changes in nuclear DNA content as measured by staining of DNA 
with propidium iodide; changes in cell size and granularity as measured by forward light scatter and 

30 90 degree side light scatter; down-regulation of DNA synthesis as measured by decrease in 

bromodeoxyuridine uptake; alterations in expression of cell surface and intracellular proteins as 
measured by reactivity with specific antibodies; and alterations in plasma membrane composition as 
measured by the binding of fluorescein-conjugated Annexin V protein to the cell surface. Methods in 
flow cytometry are discussed in Ormerod, M.G. (1994) Flow Cytometry . Oxford, New York NY. 

35 The influence of TXREG on gene expression can be assessed using highly purified 
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populations of cells transfected with sequences encoding TXREG and either CD64 or CD64-GFP. 
CD64 and CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions 
of human immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected 
cells using magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake 
5 Success NY). mRNA can be purified from the cells using methods well known by those of skill in 
the art. Expression of mRNA encoding TXREG and other genes of interest can be analyzed by 
northern analysis or microarray techniques. 

XIII. Production of TXREG Specific Antibodies 

TXREG substantially purified using pol y aery 1 amide gel electrophoresis (PAGE; see, e.g., 
10 Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the TXREG amino acid sequence is analyzed using LASER GENE software 
(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
15 selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art. (See, e.g., Ausubel, 1995, supra , ch. 11.) 

Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A 
peptide synthesizer (PE Biosystems) using FMOC chemistry and coupled to KLH (Sigma-Aldrich, St. 
Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase 
20 immunogenicity. (See, e.g., Ausubel, 1995, supra .) Rabbits are immunized with the oligopeptide- 
KLH complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide and anti- 
TXREG activity by, for example, binding the peptide or TXREG to a substrate, blocking with 1% 
BSA, reacting with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 

XIV. Purification of Naturally Occurring TXREG Using Specific Antibodies 

25 Naturally occurring or recombinant TXREG is substantially purified by immunoaffinity 

chromatography using antibodies specific for TXREG. An immunoaffinity column is constructed by 
covalently coupling anti-TXREG antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 

30 Media containing TXREG are passed over the immunoaffinity column, and the column is 

washed under conditions that allow the preferential absorbance of TXREG (e.g., high ionic strength 
buffers in the presence of detergent). The column is eluted under conditions that disrupt 
antibody/TXREG binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such 
as urea or thiocyanate ion), and TXREG is collected. 

35 XV. Identification of Molecules Which Interact with TXREG 
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TXREG, or biologically active fragments thereof, are labeled with I25 I Bolton-Hunter reagent. 
(See, e.g., Bolton AJE. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules 
previously arrayed in the wells of a multi-well plate are incubated with the labeled TXREG, washed, 
and any wells with labeled TXREG complex are assayed. Data obtained using different 
5 concentrations of TXREG are used to calculate values for the number, affinity, and association of 
TXREG with the candidate molecules. 

Alternatively, molecules interacting with TXREG are analyzed using the yeast two-hybrid 
system as described in Fields, S. and O. Song (1989, Nature 340:245-246), or using commercially 
available kits based on the two-hybrid system, such as the MATCHMAKER system (Clontech). 
10 TXREG may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) 

which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 
between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. 
Patent No. 6,057,101). 

15 Various modifications and variations of the described methods and systems of the invention 

will be apparent to those skilled in the art without departing from the scope and spirit of the 
invention. Although the invention has been described in connection with certain embodiments, it 
should be understood that the invention as claimed should not be unduly limited to such specific 
embodiments. Indeed, various modifications of the described modes for carrying out the invention 

20 which are obvious to those skilled in molecular biology or related fields are intended to be within the 
scope of the following claims. 
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Vector 


pINCY 


pINCY 


pINCY 


PSPORTl 


PSPORTl 


PBLUESCRIPT 


PBLUESCRIPT 


PSPORTl 


Disease or Condition 
(Fraction of Total) 


Cell proliferation (0.651) 
Inflammation (0.303) 


Cell proliferation (0.407) 
Inflammation (0.481) 


Cell proliferation (0.333) 
Inf lanunation (0 . 667) 


Cell proliferation (0.691) 
Inflammation (0.289) 


Cell proliferation (0.793) 
Inflammation (0.241) 


Cell proliferation (0.565) 
Inflammation (0.348) 


Cell proliferation (0.520) 
Inflammation (0.380) 


Cell proliferation (0.600) 
Inflammation (0.200) 


Tissue Expression 
(Fraction of Total) 


Nervous (0.248) 
Repr oduc t i ve (0.239) 
Gastrointestinal (0 . 110) 


Nervous (0.222) 
Repr oduc t i ve (0.148) 
Cardiovascular (0 . 148) 
Gastrointestinal (0". 148) 


Reproductive (0 . 6 67) 
Hematopoietic/Immune (0.333) 


Reproductive (0 .270) 
Nervous (0.184) 
Cardiovascular (0 . 132) 


Reproductive (0 . 310) 
Nervous (0.172) 
Gastrointestinal (0.172) 


Hematopoietic/Immune (0.348) 
Nervous (0.174) 
Cardiovascular (0.087) 
Gastrointestinal (0.087) 
Musculoskeletal (0.087) 
Reproductive (0.087) 


Cardiovascular (0.200) 
Reproductive (0 . 200) 
Nervous (0.140) 


Cardiovascular (0 .200) 
Endocrine (0.200) 
Hematopoietic/Immune (0 . 200 ) 
Nervous (0.200) 
Reproductive (0 .200) 


Selected Fragments 
of Nucleic Acid 
Sequence 


227-271 
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460-504 


664-708 
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What is claimed is: 

1 . An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

5 a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, 

b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, 

c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1 -32, and 

10 d) an immunogenic fragment of an amino acid sequence selected from the group consisting 

of SEQ ID NO: 1-32. 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO:l- 

32. 

15 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

20 5. An isolated polynucleotide of claim 4 selected from the group consisting of SEQ ID 

NO:33-64. 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

25 

7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

30 9. A method for producing a polypeptide of claim 1, the method comprising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said 
cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide 
comprises a promoter sequence operably linked to a polynucleotide encoding the polypeptide of 
claim 1, and 

35 b) recovering the polypeptide so expressed. 
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10. An isolated antibody which specifically binds to a polypeptide of claim 1. 

1 1 . An isolated polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of: 

5 a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, 

b) a naturally occurring polynucleotide sequence having at least 70% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, 

c) a polynucleotide sequence complementary to a), 

d) a polynucleotide sequence complementary to b), and 
10 e) an RNA equivalent of a)-d). 

12. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 1 1 . 

15 13. A method for detecting a target polynucleotide in a sample, said target polynucleotide 

having a sequence of a polynucleotide of claim 11, the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 

20 complex is formed between said probe and said target polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, the amount thereof. 

14. A method of claim 13, wherein the probe comprises at least 60 contiguous nucleotides. 

25 - 

15. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 11, the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain reaction 
amplification, and 

30 b) detecting the presence or absence of said amplified target polynucleotide or fragment 

thereof, and, optionally, if present, the amount thereof. 

16. A pharmaceutical composition comprising an effective amount of a polypeptide of claim 
1 and a pharmaceutically acceptable excipient. 

35 
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17. A pharmaceutical composition of claim 16, wherein the polypeptide comprises an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-32. 

18. A method for treating a disease or condition associated with decreased expression of 
5 functional TXREG, comprising administering to a patient in need of such treatment the 

pharmaceutical composition of claim 16. 

19. A method for screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1 , the method comprising: 

10 a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity in the sample. 

20. A pharmaceutical composition comprising an agonist compound identified by a method 
of claim 19 and a pharmaceutically acceptable excipient. 

15 

21. A method for treating a disease or condition associated with decreased expression of 
functional TXREG, comprising administering to a patient in need of such treatment a pharmaceutical 
composition of claim 20. 

20 22. A method for screening a compound for effectiveness as an antagonist of a polypeptide 

of claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. 

25 23. A pharmaceutical composition comprising an antagonist compound identified by a 

method of claim 22 and a pharmaceutically acceptable excipient. 

24. A method for treating a disease or condition associated with overexpression of functional 
TXREG, comprising administering to a patient in need of such treatment a pharmaceutical 

30 composition of claim 23. 

25. A method of screening for a compound that specifically binds to the polypeptide of claim 
1 , said method comprising the steps of: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 
35 conditions, and 
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b) detecting binding of the polypeptide of claim 1 to the test compound, thereby identifying 
a compound that specifically binds to the polypeptide of claim 1. 

26. A method of screening for a compound that modulates the activity of the. polypeptide of 
claim 1, said method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under conditions 
permissive for the activity of the polypeptide of claim 1, 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test compound, 

and 

c) comparing the activity of the polypeptide of claim 1 in the presence of the test compound 
with the activity of the polypeptide of claim 1 in the absence of the test compound, wherein a change 
in the activity of the polypeptide of claim 1 in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide of claim 1. 

27. A method for screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 
comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, and 

b) detecting altered expression of the target polynucleotide. 
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Thr 


Leu 
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Glu 


Ala 


Pro 


Gin 


Val 


Met 


Ser 


Leu 


His 


Val 
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Ala 


Gin 


Asp 


Leu 
45 
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He 


Arg 
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Glu 


Glu 
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Glu 
55 


Glu 


Lys 
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Ser 
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60 
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Ser 


Glu 


Ser 


Ser 
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Ser 


Ser 


Lys 


Pro 


Asn 
70 


Gin 


Lys 


Lys 


Leu 


He 
75 


Val 


Leu 


Ser 
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Ser 
80 
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Val 
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85 


Ser 


Asp 
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Ser 


Glu 
90 


Val 


He 


Thr 


Leu 


Ser 
95 


Asp 


Glu 


Asp 


Ser 


He 
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Tyr 


Arg 
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Lys 


Gly 
105 


Lys 


Asn 


Val 


Arg 


Val 
110 


Gin 


Ala 


Gin 


Glu 


Asn 
115 


Ala 
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Leu 


Ser 
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Ser 


Ser 


Leu 
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Ser 
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Asn 


Glu 
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Val 


Asp 
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Lys 


Lys 
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Ser 
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Asp 


He 


Glu 


Lys 


Pro 
140 
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Ser 


Glu 


Glu 


Arg 
145 


Ser 
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Val 


He 


Arg 
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Val 


Met 
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He 
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Glu 


Val 


Ser 


Ser 


Ser 
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Glu 


Glu 


Glu 


Ser 
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Thr- 


He 


Ser 


Glu 


Gly 
170 


Asp 
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Val 


Glu 


Ser 
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Trp 


Met 


Leu 
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Val 


Asp 


Asp 
185 
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Asp 


Asp 


Asp 


He 
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Val 
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230 








He 
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Pro 
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Trp 


Asp 


Lys 


Gin 
305 
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Pro 
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Pro 


Pro 
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Lys 


Leu 


Glu 


Asn 


Pro 


His 


His 
425 


Asp 


He 


Arg 


Ser 


Asn 


Arg 


Trp 


Pro 
440 


Gin 


Glu 


Asn 


Lys 


Asn 


Lys 


Asn 


Arg 
455 


Asn 


Trp 


Glu 


His 


Arg 


Glu 


Val 


Asp 
470 


Glu 


Asp 


Phe 


Ser 


Ser 


Pro 


Gly 


Ser 
485 


Phe 


Lys 


Thr 


His 


Arg 


Ser 


Ser 


His 
500 


Tyr 


His 


Thr 


Lys 


Glu 


Gly 


Lys 


Arg 
515 


Gly 


Lys 


Gin 


Glu 


Asp 


Asp 


Asp 


Asn 
530 


Asp 


Asn 


Leu 


Lys 


Lys 


Ser 
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Met 


Tyr 


Ser 


Thr 


Arg 


Lys 


Asn 


Cys 


1 






5 








Ala 


Ala 


Phe 


He 


Asn 
20 


His 


Asp 


Cys 


Ser 


Thr 


Gly 


Arg 


Asp 
. 35 


Thr 


Ala 


Cys 


Glu 


Pro 


Gly Glu 


Glu 


He 


Ser 


Cys 










50 








Gly 


Glu 


Asn 


Asn 


Glu 


Phe 


Cys 


Glu 








65 








Gly 


Thr 


Gly 


Ala 


Phe 


Lys 


Ser 


Arg 








80 








Pro 


Val 


He 


Asn 


Ser 
95 


Lys 


Tyr 


Gly 





235 










240 


Arg 


Gly 
250 


His 


Leu 


Ser 


Lys 


Asn 
255 


Arg 


Cys 
265 


Phe 


Leu 


Cys 


Ser 


Arg 
270 


Pro 


Ala 
280 


Pro 


Leu 


Cys 


Glu 


Tyr 
285 


His 


Ser 


Cys 


Leu 


Phe 


Arg 


His 




295 








300 


Cys 


His 
310 


Met 


Leu 


Gly 


His 


Tyr 
315 


Arg 


Gin 


Tyr 


His 


Leu 


Thr 


Thr 


325 










330 


Lys 


Thr 
340 


Pro 


Ser 


Arg 


Pro 


Ser 
345 


Ala 


Gin 
355 


Lys 


Gly 


His 


Tyr 


Gly 
360 


Tyr 


Asp 
370 


Pro 


Ser 


Pro 


Val 


Ser 
375 


Lys 


Tyr 
385 


Glu 


He 


Gin 


Glu 


Arg 
390 


Lys 


Val 
400 


Leu 


Lys 


Lys 


Asn 


Gly 
405 


Pro 


Tyr 
415 


He 


Lys 


Ala 


Ala 


Asn 
420 


Lys 


Gly 
430 


Arg 


Ala 


Ser 


Trp 


Lys 
435 


Lys 


Glu 
445 


Thr 


Gin 


Lys 


Glu 


Met 
450 


Lys 


His 
460 


Arg 


Lys 


Ala 


Asp 


Arg 
465 


Pro 


Arg 
475 


Gly 


Pro 


Lys 


Thr 


Tyr 
480 


Gin 


Lys 
490 


Pro 


Ser 


Lys 


Pro 


Phe 
495 


Ser 


Arg 
505 


Glu 


Asp 


Lys 


Ser 


Pro 
510 


Lys 


Lys 
520 


Lys 


Glu 


Arg 


Cys 


Trp 
525 


Phe 


Leu 
535 


He 


Lys 


Gin 


Arg 


Lys 
540 



Ala 


Gin 


Leu 


Trp 


Leu 


Gly 


Pro 




10 










15 


Arg 


Pro 


Asn 


Cys 


Lys 


Phe 


Val 




25 










30 


Val 


Lys 


Ala 


Leu 


Arg 


Asp 


He 




40 










45 


Tyr 


Tyr 


Gly 


Asp 


Gly* 


Phe 


Phe 




55 










60 


Cys 


Tyr 


Thr 


Cys 


Glu 


Arg 


Arg 




70 










75 


Val 


Gly 


Leu 


Pro 


Ala 


Pro 


Ala 




85 










90 


Leu 


Arg 


Glu 


Thr 


Asp 


Lys 


Arg 




100 










105 
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Leu 


Asn 


Arg 


Leu 


Lys 
110 


Lys 


Leu 


Gly 


Asp 


Ser 
115 


Ser 


Lys 


Asn 


Ser 


Asp 
120 


Ser 


Gin 


Ser 


Val 


Ser 
125 


Ser 


Asn 


Thr 


Asp 


Ala 
130 


Asp 


Thr 


Thr 


Gin 


Glu 
135 


Lys 


Asn 


Asn 


Ala 


Thr 
140 


Ser 


Asn 


Arg 


Lys 


Ser 
145 


Ser 


Val 


Gly 


Val 


Lys 
150 


Lys 


Asn 


Ser 


Lys 


Ser 
155 


Arg 


Thr 


Leu 


Thr 


Arg 
160 


Gin 


Ser 


Met 


Ser 


Arg 
165 


He 


Pro 


Ala 


Ser 


Ser 
170 


Asn 


Ser 


Thr 


Ser 


Ser 
175 


Lys 


Leu 


Thr 


His 


He 

180 


Asn 


Asn 


Ser 


Arg 


Val 
185 


Pro 


Lys 


Lys 


Leu 


Lys 
190 


Lys 


Pro 


Ala 


Lys 


Pro 
195 


Leu 


Leu 


Ser 


Lys 


He 
200 


Lys 


Leu 


Arg 


Asn 


His 
205 


Cys 


Lys 


Arg 


Leu 


Glu 
210 


Gin 


Lys 


Asn 


Ala 


Ser 
215 


Arg 


Lys 


Leu 


Glu 


Met 
220 


Gly 


Asn 


Leu 


Val 


Leu 
225 


Lys 


Glu 


Pro 


Lys 


Val 
230 


Val 


Leu 


Tyr 


Lys 


Asn 
235 


Leu 


Pro 


He 


Lys 


Lys 
240 


Asp 


Lys 


Glu 


Pro 


Glu 
245 


Gly 


Pro 


Ala 


Gin 


Ala 
250 


Ala 


Val 


Ala 


Ser 


Gly 
255 


Cys 


Leu 


Thr 


Arg 


His 
260 


Ala 


Ala 


Arg 


Glu 


His 
265 


Arg 


Gin 


Asn 


Pro 


Val 
270 


Arg 


Gly 


Ala 


His 


Ser 
275 


Gin 


Gly 


Glu 


Ser 


Ser 
280 


Pro 


Cys 


Thr 


Tyr 


He 
285 


Thr 


Arg 


Arg 


Ser 


Val 
290 


Arg 


Thr 


Arg 


Thr 


Asn 
295 


Leu 


Lys 


Glu 


Ala 


Ser 
300 


Asp 


He 


Lys 


Leu 


Glu 
305 


Pro 


Asn 


Thr 


Leu 


Asn 
310 


Gly 


Tyr 


Lys 


Ser 


Ser 
315 


Val 


Thr 


Glu 


Pro 


Cys 


Pro 


Asp 


Ser 


Gly Glu 


Gin 


Leu 


Gin 


Pro 


Ala 










320 










325 










330 


Pro 


Val 


Leu 


Gin 


Glu 


Glu 


Glu 


Leu 


Ala 


His 


Glu 


Thr 


Ala 


Gin 


Lys 










335 










340 










345 


Gly 


Glu 


Ala 


Lys 


Cys 
350 


His 


Lys 


Ser 


Asp 


Thr 
355 


Gly 


Met 


Ser 


Lys 


Lys 
360 


Lys 


Ser 


Arg 


Gin 


Gly 
365 


Lys 


Leu 


Val 


Lys 


Gin 
370 


Phe 


Ala 


Lys 


He 


Glu 
375 


Glu 


Ser 


Thr 


Pro 


Val 
380 


His 


Asp 


Ser 


Pro 


Gly 
385 


Lys 


Asp 


Asp 


Ala 


Val 
390 


Pro 


Asp 


Leu 


Met 


Gly 
395 


Pro 


His 


Ser 


Asp 


Gin 
400 


Gly 


Glu 


His 


Ser 


Gly 
405 


Thr 


Val 


Gly 


Val 


Pro 
410 


Val 


Ser 


Tyr 


Thr 


Asp 
415 


Cys 


Ala 


Pro 


Ser 


Pro 
420 


Val 


Gly 


Cys 


Ser 


Val 
425 


Val 


Thr 


Ser 


Asp 


Ser 
430 


Phe 


Lys 


Thr 


Lys 


Asp 
435 


Ser 


Phe 


Arg 


Thr 


Ala 
440 


Lys 


Ser 


Lys 


Lys 


Lys 
445 


Arg 


Arg 


He 


Thr 


Arg 
450 


Tyr 


Asp 


Ala 


Gin 


Leu 
455 


He 


Leu 


Glu 


Asn 


Asn 
460 


Ser 


Gly 


He 


Pro 


Lys 
465 


Leu 


Thr 


Leu 


Arg 


Arg 
470 


Arg 


His 


Asp 


Ser 


Ser 
475 


Ser 


Lys 


Thr 


Asn 


Asp 
480 


Gin 


Glu 


Asn 


Asp 


Gly 
485 


Met 


Asn 


Ser 


Ser 


Lys 
490 


He 


Ser 


He 


Lys 


Leu 
495 


Ser 


Lys 


Asp 


His 


Asp 
500 


Asn 


Asp 


Asn 


Asn 


Leu 
505 


Tyr 


Val 


Ala 


Lys 


Leu 
510 


Asn 


Asn 
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Phe 


Asn 
515 
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Gly 


Ser 


Gly 


Ser 
520 


Ser 


Ser 


Thr 


Lys 


Leu 
525 


Lys 


He 


Gin 


Leu 


Lys 
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Arg 


Asp 


Glu 


Glu 


Asn 
535 


Arg 
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Ser 


Tyr 


Thr 
540 


Glu 


Gly 


Leu 


His 


Glu 
545 


Asn 


Gly 


Val 


Cys 


Cys 
550 


Ser 


Asp 


Pro 


Leu 


Ser 
555 


Leu 


Leu 


Glu 


Ser 


Arg 
560 


Met 


Glu 


Val 


Asp 


Asp 
565 


Tyr 


Ser 


Gin 


Tyr 


Glu 
570 


Glu 


Glu 


Ser 


Thr 


Asp 
575 


Asp 


Ser 


Ser 


Ser 


Ser 
580 


Glu 


Gly 


Asp 


Glu 


Glu 
585 


Glu 


Asp 


Asp 


Tyr 


Asp 
590 


Asp 


Asp 


Phe 


Glu 


Asp 
595 


Asp 


Phe 


He 


Pro 


Leu 
600 


Pro 


Pro 


Ala 


Lys 


Arg 


Leu 


Arg 


Leu 


He 


Val 


Gly 


Lys 


Asp 


Ser 


He 
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605 610 615 

Asp lie Asp lie Ser Ser Arg Arg Arg Glu Asp Gin Ser Leu Arg 
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Met 


Ser 


Ala 


Phe 


Ser 


Glu 


Ala 


Ala 


Leu 


Glu 


Lys 


Lvs 


Leu 


Ser 


Glu 
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10 










15 


Leu 


Ser 


Asn 


Ser 


Gin 


Gin 


Ser 


Val 


Gin 


Thr 


Leu 


Ser 


Leu 


Tro 


Leu 










20 










25 










30 


lie 
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His 


Arg 


Lys 


His 


Ser 


Arg 


Pro 


lie 


Val 


Thr 


Val 


Tro 


Glu 










35 










40 










45 


AT"Cf 

y 


Glu 


Leu 




Lys 


Ala 


Lys 


Pro 


Asn 


Arg 


Lys 


Leu 


Thr 


Phe 


Leu 
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55 










60 


r P\/'T" 

x y x 


Leu 


Ala 


Asn 


Asp 


Val 


lie 


Gin 


Asn 


Ser 


Lys 


Arg 


Lys 


Gly 


Pro 










65 










70 










75 


Glu 


Phe 


Thr 


Lys 


Asp 


Phe 


Ala 


Pro 


Val 


He 


Val 


Glu 


Ala 


Phe 


Lys 








80 










85 










90 




Val 


Ser 


Ser 


Glu 


Thr 


Asp 


Glu 


Ser 


v,y t> 


Lys 


Lys 


His 


Leu 


Gly 










95 










100 










105 


Arg 


V d -L 


Leu 


OCX 


Tip 
J- J- 


lip 


Glu 


Glu 


Arg 


Ser 


Val 


i y x 


Glu 


Asn 


Asp 










110 










115 










120 


v a i 


i_>eu 


ri~] , 

Ul XJ. 


Gin 


Leu 


Lys 


Gin 


Ala 


Leu 


x y i 


Gly 


Asp 


Lys 


Lys 


Pro 










12 5 










130 










135 


- 

Arg 


i-<y s 


Arg 


Thr 


j. y j. 


Glu 


Gin 


He 


Lys 


Val 


Asp 


Glu 


Asn 


Glu 


Asn 










14 0 










145 










150 


v_y 


Ser 


Ser 


Leu 


Gly 


Ser 


Pro 


Ser 


Glu 


Pro 


Pro 


Gin 


Thr 


Leu 


Asp 








155 










160 










165 


Leu 


Val 


Arg 


Ala 


Leu 


Gin 


Asp 


Leu 


Glu 


Asn 


Ala 


Ala 


Ser 


Gly 


Asp 










170 










175 










180 


Ala 


Ala 


Val 


His 


Gin 


Arg 


He 


Ala 


Ser 


Leu 


Pro 


Val 


Glu 


Val 


Gin 










185 










190 










195 


Glu 


Val 


Ser 


Leu 


Leu 


Asp 


Lys 


He 


Thr 


Asp 


Lys 


Glu 


Ser 


Gly Glu 










200 










205 










210 


Arg 


Leu 


Ser 


Lys 


Met 


Val 


Glu 


Asp 


Ala 


Cys 


Met 


Leu 


Leu 


Ala 


Asp 








215 










220 










225 


Tyr 


Asn 


Gly 


Arg 


Leu 


Ala 


Ala 


Glu 


He 


Asp 


Asp 


Arg 


Lys 


Gin 


Leu 










230 










235 










240 


Thr 


Arg 


Met 


Leu 


Ala 


Asp 


Phe 


Leu 


Arg 


Cys 


Gin 


Lys 


Glu 


Ala 


Leu 










245 










250 










255 


Ala 


Glu 


Lys 


Glu 


His 


Lys 


Leu 


Glu 


Glu 


Tyr 


Lys 


Arg 


Lys 


Leu 


Ala 








260 










265 










270 


Arg 


Val 


Ser 


Leu 


Val 


Arg 


Lys 


Glu 


Leu 


Arg 


Ser 


Arg 


He 


Gin 


Ser 








275 










280 










285 


Leu 


Pro 


Asp 


Leu 


Ser 


Arg 


Leu 


Pro 


Asn 


Val 


Thr 


Gly 


Ser 


His 


Met 










290 










295 










300 


His 


Leu 


Pro 


Phe 


Ala 


Gly Asp 


He 


Tyr 


Ser 


Glu 


Asp 
















305 










310 
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<400> 9 



Met 


Phe 


Asn 


Gly 


Gly 


Met 


Ala 


• 1 








5 






Pro 


Asp 


Val 


Glu 


Pro 
20 


Ala 


Ala 


Tyr 


Ser 


Asp 


Glu 


Val 
35 


Gin 


He 


Leu 


Tyr 


Thr 


Ala 


Lys 
50 


Lys 


Tyr 


Cys 


Val 


Glu 


Phe 


Leu 
65 


Lys 


Lys 


Met 


Leu 


Leu 


Thr 


Gin 
80 


Ala 


Arg 


Ser 


Leu 


Cys 


Leu 


Glu 
95 


Asn 


He 


Thr 


Ala 


Glu 


Gly 


Phe 
110 


Thr 


Asp 


Val 


Leu 


Glu 


Arg 


Asp 
125 


Thr 


Leu 


Asn 


Ala 


Val 


Val 


Arg 
140 


Trp 


Ser 


Leu 


Gin 


Val 


Thr 


Pro 
155 


Glu 


Asn 


Leu 


Gly 

w Jr 


Leu 


He 


Aro 
170 


Phe 


Pro 


Ala 


Gly 


Pro 


Ala 


Gin 
185 


Ser 


Gly 


Ser 


Leu 


Phe 


Leu 


His 
200 


Phe 


Thr 


Phe 


He 


Asp 


Aro 


Pro 
215 


Arcr 


Cys 


He 


Asn 


Arcr 


Phe 


Gin 
230 


Gin 


Val 


Thr 


Ser 


Asp 


Arcr 


He 
245 


Arcr 


Phe 


Val 


Gly 


Phe 


Gly 


Leu 
260 


Tyr 


Gly 


Gin 


Val 


Asn 


He 


Gin 
275 


He 


He 


Gly 


Gin 


Asn 


Asp 


Thr 
290 


Gly 

w - 1 - Jr 


Phe 


Phe 


Arg 


Val 


Met 


Phe 
305 


Lys 


Glu 


Asn 


Tyr 


Thr 


Ala 


Cys 
320 


Ala 


Thr 


Gly 


Thr 


Lys 


Gly 


Leu 
335 


Arg 


Lys 


Gly 


Ala 


Lys 


Thr 


Cys 
350 


Phe 


Thr 


Asn 


Gly 


Thr 


Ser 


Val 


Glu 


Asp 



365 

Tyr Thr 



Thr 


Thr 


Ser 


Thr 


Glu 


He 


Glu 


Leu 






10 










15 


Phe 


Leu 


Ala 


Leu 


Leu 


Lys 


Phe 


Leu 






25 










30 


Gly 


Pro 


Glu 


Thr 


Val 


Met 


Thr 


Thr 






40 










45 


Ala 


Val 


Pro 


Ala 


Leu 


Glu 


Ala 


His 






55 










60 


Asn 


Leu 


Arg 


Ala 


Asp 


Asn 


Ala 


Phe 






70 










75 


Leu 


Phe 


Asp 


Glu 


Pro 


Gin 


Leu 


Ala 






85 










90 


Asp 


Lys 


Asn 


Thr 


Ala 


Asp 


Ala 


He 






100 










105 


He 


Asp 


Leu 


Asp 


Thr 


Leu 


Val 


Ala 






115 










120 


Gly 


He 


Arg 


Glu 


Val 


Arg 


Leu 


Phe 






130 










135 


Glu 


Ala 


Glu 


Cys 


Gin 


Ara 


Gin 


Gin 






145 










150 


Arg 


Arg 


Lys 


Val 


Leu 


Gly 


Lys 


Ala 






160 










165 


Leu 


Met 


Thr 


He 


Glu 


Glu 


Phe 


Ala 






175 










180 


He 


Leu 


Val 


ASD 


Arq 


Glu 


Val 


Val 






190 










195 


Val 


Asn 


Pro 


Lys 


Pro 


Ara 


Val 


Glu 






205 










210 


Cys 


Leu 


Arcr 


Gly 

w -*- Jr 


Lys 


Glu 


Cvs 


Ser 






220 










225 


Glu 


Ser 


Aro 


Tro 


Gly 


Tyr 


Ser 


Gly 






235 










240 


Ser 


Val 


Asn 


Lys 


Ara 


He 


Phe 


Val 






250 










255 


Ser 


He 


His 


Gly 


Pro 


Thr 


Asp 


Tyr 

Jr 






265 










270 


His 


Thr 


Asp 


Ser 


Asn 


Thr 


Val 


Leu 






280 










285 


Ser 


Cys 


Asp Gly 


Ser 


Ala 


Ser 


Thr 






295 










300 


Pro 


Val 


Glu 


Val 


Leu 


Pro 


Asn 


Val 






310 










315 


Leu 


Lys 


Gly 


Pro 


Asp 


Ser 


His 


Tvr 






325 










330 


Val 


Thr 


His 


Glu 


Ser 


Pro 


Thr 


Thr 






340 










345 


Phe 


Cys 


Tyr 


Ala 


Ala 


Gly 


Asn 


Asn 






355 










360 


Gly 


Gin 


He 


Pro 


Glu 


Val 


He 


Phe 






370 










375 
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Leu 


Gly 


Ala 


Pro 


Lvs 


Cvs 


Thr 


Arg 


Cys 


Leu 


lie 


Thr 


Phe 


Ala 


Asp 










35 










40 










45 


Ser 


Lvs 


Phe 


Gin 


Glu 


Arq 


His 


Met 


Lys 


Ara 


Glu 


His 


Pro 


Ala 


Asp 










50 










55 










60 


Phe 


Val 


Ala 


Gin 


Lys 


Leu 


Gin 


Gly 


Val 


Leu 


Phe 


He 


Cys 


Phe 


Thr 










65 








70 










75 


Cvs 


Ala 


Arq 


Ser 


Phe 


Pro 


Ser 


Ser 


LVS 


Ala 


Leu 


He 


Thr 


His 


Gin 








80 










85 










90 


Arc? 


Ser 


His 


Gly 


Pro 


Ala 


Ala 


LVS 


Pro 


Thr 


Leu 


Pro 


Val 


Ala 


Thr 








95 










100 










105 


Thr 


Thr 


Ala 


Gin 


Pro 


Thr 


Phe 


Pro 


Cvs 


Pro 


Asp 


Cys 


Gly 


Lys 


Thr 










110 










115 










120 


Phe 


Gly 


Gin 


Ala 


Val 


Ser 


Leu 


Arg 


Arg 


His 


Arg 


Gin 


Met 


His 


Glu 










125 










130 










135 


Val 


Arg 


Ala 


Pro 


Pro 


Gly 


Thr 


Phe 


Ala 


Cys 


Thr 


Glu 


Cys 


Gly 


Gin 








140 










145 










150 


Asp 


Phe 


Ala 


Gin 


Glu 


Ala 


Gly 


Leu 


His 


Gin 


His 


Tyr 


He 


Arg 


His 








155 










160 










165 


Ala 


Arg 


Gly Glu 


Leu 























170 
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Met 


Ala 


Arg 


Thr 


Lys 


Gin 


Thr 


Ala 


Arg 


Lys 


Ser 


Thr 


Gly 


Gly 


Lys 


1 






5 










10 










15 


Ala 


Pro 


Arg 


Lys 


Gin 
20 


Leu 


Ala 


Thr 


Lys 


Ala 
25 


Ala 


Arg 


Lys 


Ser 


Ala 
30 


Pro 


Ala 


Thr 


Gly 


Gly 
35 


Val 


Lys 


Lys 


Pro 


His 
40 


Arg 


Tyr 


Arg 


Pro 


Gly 
45 


Thr 


Val 


Ala 


Leu 


Arg 
50 


Glu 


He 


Arg 


Arg 


Tyr 
55 


Gin 


Lys 


Ser 


Thr 


Glu 
60 


Leu 


Leu 


He 


Arg 


Lys 
65 


Leu 


Pro 


Phe 


Gin 


Arg 
70 


Leu 


Val 


Arg 


Glu 


He 
75 


Ala 


Gin 


Asp 


Phe 


Lys 
80 


Thr 


Asp 


Leu 


Arg 


Phe 
85 


Gin 


Ser 


Ser 


Ala 


Val 
90 


Met 


Ala 


Leu 


Gin 


Glu 
95 


Ala 


Cys 


Glu 


Ala 


Tyr 
100 


Leu 


Val 


Gly 


Leu 


Phe 
105 


Glu 


Asp 


Thr 


Asn 


Leu 


Cys 


Gly 


He 


Gin 


Arg 


Gin 


Ala 


Arg 


His 


Tyr 








110 










115 










120 


His 


Ala 


Gin 


Gly 


His 


Pro 


Thr 


His 


Pro 


Pro 


Ala 


Ser 


Ala 


Glu 


Glu 








125 










130 










135 


Arg 


Ala 


Val 


He 


Thr 


Val 


Gly 


Leu 


Ser 


Cys 


Arg 


Ser 


Lys 


Gin 


Arg 








140 










145 










150 


Val 


Phe 


Phe 


Arg 


Ala 
155 


Thr 


Thr 


Phe 


Ser 


Lys 
160- 













<210> 12 

<211> 219 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1969301CD1 

<400> 12 

Met Asn Arg Leu Phe Gly Lys Ala Lys Pro Lys Ala Pro Pro Pro 
15 10 15 

Ser Leu Thr Asp Cys He Gly Thr Val Asp Ser Arg Ala Glu Ser 



10/50 



BNSDOCID: <WO 0078954A2_L> 



WO 00/78954 



PCT/US00/16766 



20 



He 


Asp 


Lys 


Lys 


He 
35 


Ser 


Arg 


Leu 


Lys 


Asp 


Gin 


He 


Lys 
50 


Lys 


Met 


Arg 


Val 


Lys 


Gin 


Lys 


Ala 
65 


Leu 


Arg 


Val 


Glu 


Gin 


Gin 


Arg 


Asp 
80 


Asn 


Leu 


Ala 


Gin 


Ala 


Asn 


Tyr 


Thr 
95 


He 


Gin 


Ser 


Val 


Asp 


Ala 


Met 


Lys 
110 


Leu 


Gly 


Val 


Lys 


Gin 


Val 


Lys 


lie 
125 


Asp 


Gin 


He 


Glu 


Asp 


Met 


Met 


Glu 
140 


Asp 


Ala 


Asn 


Arg 


Ser 


Tyr 


Gly 


Thr 
155 


Pro 


Glu 


Leu 


Glu 


Leu 


Asp 


Ala 


Leu 
170 


Gly 


Asp 


Glu 


Ser 


Tyr 


Leu 


Asp 


Glu 
185 


Ala 


Ala 


Ser 


Val 


Pro 


Thr 


Asp 


Thr 
200 


Lys 


Asn 


Lys 


Phe 


Gly 


Leu 


Pro 


Gin 


He 


Pro 


Ala 



215 

<210> 13 
<211> 142 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1986873CD1 

<400> 13 



Met 


Asp 


Met 


Thr 


Ser 


Pro 


Glu 


Gin 


1 








5 








Thr 


Glu 


Glu 


Lys 


Glu 
20 


Ala 


Phe 


He 


Tyr 


Met 


Lys 


Arg 


Gly 
35 


Lys 


Gly 


Lys 


Cys 


Arg 


Ala 


Met 


Lys 
50 


Lys 


Gly 


Ala 


Lys 


His 


Asn 


Val 


His 
65 


Ser 


Pro 


Tyr 


Ala 


Phe 


Leu 


Leu 


Glu 
80 


Ser 


Leu 


Leu 


Gly 


Gin 


Ser 


Leu 


Leu 
95 


Lys 


Cys 


Pro 


Phe 


Pro 


Arg 


Gly 


Phe 
110 


Lys 


Lys 


His 


His 


Asn 


Glu 


Glu 


Ala 
125 


Asn 


Lys 


Lys 


Pro 


Leu 


Glu 


Glu 


Gin 
140 


Gin 


lie 





<210> 14 
<211> 524 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2010820CD1 





25 










30 


Asp 


Ala 
40 


Glu 


Leu 


Val 


Lys 


Tyr 
45 


Glu 


Gly 
55 


Pro 


Ala 


Lys 


Asn 


Met 
60 


Leu 


Lys 
70 


Gin 


Lys 


Arg 


Met 


Tyr 
75 


Gin 


Gin 
85 


Ser 


Phe 


Asn 


Met 


Glu 
90 


Leu 


Lys 
100 


Asp 


Thr 


Lys 


Thr 


Thr 
105 


Lys 


Glu 
115 


Met 


Lys 


Lys 


Ala 


Tyr 
120 


Glu 


Asp 
130 


Leu 


Gin 


Asp 


Gin 


Leu 
135 


Glu 


He 
145 


Gin 


Glu 


Ala 


Leu 


Ser 
150 


Asp 


Glu 
160 


Asp 


Asp 


Leu 


Glu 


Ala 
165 


Leu 


Leu 
175 


Ala 


Asp 


Glu 


Asp 


Ser 
180 


Ala 


Pro 
190 


Ala 


He 


Pro 


Glu 


Gly 
195 


Asp 


Gly 
205 


Val 


Leu 


Val 


Asp 


Glu 
210 


Ser 















Ser 


Arg 


Asn 


Val 


Leu 


Gin 


Phe 




10 










15 


Ser 


Glu 


Glu 


Glu 


He 


Ala 


Lys 




25 










30 


Tyr 


Tyr 


Cys 


Lys 


He 


Cys 


Cys 




40 










45 


Val 


Leu 


His 


His 


Leu 


Val 


Asn 




55 










60 


Lys 


Cys 


Thr 


He 


Cys 


Gly 


Lys 




70 










75 


Lys 


Asn 


His 


Val 


Ala 


Ala 


His 




85 










90 


Arg 


Cys 


Asn 


Phe 


Glu 


Ser 


Asn 




100 










105 


Leu 


Thr 


His 


Cys 


Gin 


Ser 


Arg 




115 










120 


Leu 


Met 


Glu 


Ala 


Leu 


Glu 


Pro 




130 










135 
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<400> 14 



Met 


Ala 


Glu 


He 


Lys 


Val 


Lys 


Leu 


1 








5 








Glu 


Asn 


Cvs 


He 


Thr 


Leu 


Gin 


Asp 










20 








LVS 


Glu 


Glu 


He. 


Lys 


Ala 


Leu 


Glu 








35 








Lys 


Glu 


Thr 


Glu 


Gin 


Leu 


Glu 


He 








50 








Asn 


Asp 


Ala 


Glu 


Thr 


Leu 


Gin 


Lvs 










65 








Leu 


Leu 


Lvs 


Gin 


Met 


Ser 


He 


Ser 










80 








Asn Gly 


He 


He 


Glu 


Ser 


Leu 


He 










95 








His 


Pro 


Val 


Val 


Arg 


Asn 


Leu 


Ala 










110 












Gin 


Asn 


Gin 


Asp 


Phe 


Ala 










12 5 








Leu 


ri 1 n 


Val 


Leu 


Gin 


He 


Asp 


Asp 










140 








Jj€U 


T A/O 


Ala 


He 


Phe 


ASP 


Gin 


Leu 








155 










T \ro 

ljjr' i> 


Thr 


Lvs 


Lvs 


He 


Lys 


Thr 










170 








Tip 

lie 


Asn 


Ser 


Asn 


Asp 


Glu 


Gin 


Glu 










185 








Ala 




Ala 


Lys 


Asn 


Val 


Leu 


Lvs 










200 








Ser 


Pin 
1 U 


Val 


Ser 


Glu 


Leu 


Arcr 


Thr 










215 








Lys 


Leu 


Met 


Phe 


Ser 


Glv 


Leu 


Leu 










230 








AX" 5 


J_j tr L-l 


lie 


Leu 


Leu 


Tro 


Tvr 


Asn 










245 








Gin 


Leu 


Arg 


His 


Cvs 


Leu 


Glv 


Val 










260 








Ala 


Ser 


Arg 


Thr 


Asn 


Gin 


Glu 


Cvs 










275 








Thr 


Leu 


Gin 


Thr 


Leu 


Ala 


Asn 


Ala 










290 








Glu 


He 


Asp 


He 


Thr 


Asn 


Val 


Ala 










305 








Arg 


Pro 


Ser 


Glv 


Leu 


Asn 


Pro 


Gin 










320 








Gin 


Ala 


Leu 


Thr 


Val 


His 


Asp 


Asn 










335 








Glu 


He 


Leu 


Thr 


Ser 


Pro 


Cys 


Ser 










350 








Lys 


Ala 


Leu 


Ser 


Ser 


Leu 


Glu 


Leu 








365 








Leu 


Leu 


Val 


Leu 


Leu 


Asn 


Glu 


He 










380 








Thr 


Cys 


Leu 


Arq 


Ala 


Leu 


Glu 


Lvs 










395 








Gly Asn 


Lvs 


Glu 


Phe 


Glv 


Asp 


Gin 










410 








Thr 


Leu 


Thr 


Thr 


Thr 


Thr 


Phe 


Gin 










425 








Glu 


Val 


Tyr 


Met 


Thr 


Pro 


Leu 


Arg 










440 








Ser 


Lys 


Ser 


Thr 


Gin 


Leu 


Lys 


Thr 










455 








Thr 


Val 


Ser 


Ala 


Arg 


Thr 


Asn 


Arg 










470 








Asp 


Ser 


Glu 


Ser 


Asp 


His 


Glu 


Val 



485 
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He 


Glu 


Ala 


LVS 


Glu 


Ala 


Leu 




10 










15 


Phe 


Asn 


Arcr 


Ala 


Ser 


Glu 


Leu 




25 










30 


Asp 


Ala 


Arg 


He 


Asn 


Leu 


Leu 




40 










45 


Lys 


Glu 


Val 


His 


He 


Glu 


Lys 




55 










60 


Cys 


Leu 


He 


Leu 


Cys 


Tyr 


Glu 




70 










75 


Thr 


Glv 


Leu 


Ser 


Ala 


Thr 


Met 




85 










90 


Leu 


Pro 


Glv 


He 


He 


Ser 


He 




100 










105 


Val 


Leu 


Cvs 


Leu 


Gly Cys 


Cvs 




115 










120 


Arq 


Lvs 


His 


Phe 


val 


Lieu 


Leu 




130 










135 


Val 


Thr 


He 


Lvs 


Tl <=> 
lie 


ser 


Ala 




145 










150 


Met 


Thr 


Phe 


Glv 


11c 


UlU 


Pro 




160 










165 


Leu 


His 


Cys 


Glu 


n:l \/ 
uiy 


i nx. 


Glu 




175 










180 


Ser 


Lys 


Glu 


Val 


VjX Li 


Ol Li 


Thr 




190 










195 


Leu 


Leu 


Ser 


Asp 


iriie 


Leu 


Asp 




205 










210 


Glv 


Ala 


Ala 


Glu 


uiy 


Leu 


Ala 




220 










225 


Val 


Ser 


Ser 


Arg 


lie 


Leu 


Ser 




23 5 










240 


Pro 


Val 


Thr 


Glu 


UlU 


ASp 


Val 




250 










255 


Phe 


Phe 


Pro 


Val 


Phe 


Ala 


Tvr 




265 










270 


Phe 


Glu 


Glu 


Ala 


Phe 


Leu 


Pro 




280 










285 


Pro 


Ala 


Ser 


Ser 


Pro 


Leu 


Ala 




295 










300 


Glu 


Leu 


Leu 


Val 


Asp 


Leu 


Thr 




310 










315 


Ala 


Lys 


Thr 


Ser 






Tvr 




325 










330 


Leu 


Ala 


Met 


Lys 


He 


Cys 


Asn 




340 










345 


Pro 


Glu 


He 


Ara 


Val 


Tyr 


Thr 




355 










360 


Ser 


Ser 


His 


Leu 


Ala 


Lys 


Asp 




370 










375 


Leu 


Glu 


Gin 


Val 


Lys 


Asp 


Arq 




385 










390 


He 


Lvs 


He 


Gin 


Leu 


Glu 


Lys 




400 










405 


Ala 


Glu 


Ala 


Ala 


Gin 


Asp 


Ala 




415 










420 


Asn 


Glu 


Asp 


Glu 


Lys 


Asn 


Lvs 




430 










435 


Gly 


Val 


Lys 


Ala 


Thr 


Gin 


Ala 




445 










450 


Asn 


Arg 


Gly 


Gin 


Arg 


Lys 


Val 




460 










465 


Arg 


Cys 


Gin 


Thr 


Ala 


Glu 


Ala 




475 










480 


Pro 


Glu 


Pro 


Glu 


Ser 


Glu 


Met 




490 










495 
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Lys Met Arg Leu Pro 
500 

Ser Lys Leu Asn Leu 
515 



Arg Arg Ala Lys Thr 
505 

Ala Gin Phe Leu Asn 
520 



Ala Ala Leu Glu Lys 
510 

Glu Asp Leu Ser 



<210> 15 

<211> 500 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2013818CD1 



<400> 15 



Met 


Pro 


Gly 


Gin 


Ser 


val 


Arg 


1 








5 






He 


Ser 


Glu 


Ala 


Ser 
20 


Glu 


Asn 


Ser 


Gin 


Asn 


Gin 


Gin 
35 


He 


Pro 


Gly 


Arg 


Arg 


Lys 


Lys 
50 


Glu 


Val 


Ser 


Ser 


Val 


Glu 


Gin 
65 


Glu 


Leu 


Lys 


Arg 


Leu 


Lys 


Ser 
80 


Ser 


Gin 


Thr 


Thr 


Lys 


Lys 


Glu 
95 


Val 


Lys 


Pro 


Arg 


Arg 


He 


Lys 
110 


Arg 


Ser 


He 


He 


Asn 


Asp 


Leu 
125 


Lys 


Val 


Met 


He 


Arg 


Lys 


Leu 
140 


Arg 


Ser 


Thr 


Gly 


Asn 


Lys 


Gin 
155 


Asp 


Asp 


He 


Lys 


His 


Val 


Arg 
170 


Arg 


Val 


Asp 


Val 


Arg 


Glu 


Asp 
185 


Ser 


Asn 


Gin 


Ala 


Glu 


Phe 


Asp 
200 


Met 


Ser 


Pro 


Arg 


Lys 


He 


Asn 
215 


Pro 


Ser 


Lys 


Glu 


Glu 


Arg 


Ser 
230 


Pro 


Lys 


Arg 


Arg 


Ser 


Thr 


Arg 
245 


Asn 


Thr 


Val 


Gly 


Lys 


Pro 


Ala 

260 


Leu 


Gly 


Glu 


Leu 


Ser 


Met 


Val 
275 


Met 


Ser 


Thr 


Glu 


Ser 


Gin 


Ser 
290 


Gin 


Lys 


Glu 


Arg 


Thr 


Asp 


Glu 
305 


Met 


Thr 


Glu 


Arg 


Leu 


Leu 


Ala 
320 


Thr 


Ala 


Ser 


Arg 


Thr 


Arg 


Ser 
335 


Ser 


Lys 


Glu 


Pro 


Asn 


Asn 


Glu 
350 


Pro 


Leu 


Arg 


Lys 


Ala 


Lys 


Ala 
365 


Lys 


Lys 


Glu 


Leu 


Val 


Ser 


Asp 


Leu 


Ser 



380 



Lys 


Lys 


Thr 


Arg 


Lys 


Ala 


Lys 


Glu 






10 










15 


He 


Tyr 


Ser 


Asp 


Val 


Arg 


Gly 


Leu 






25 










30 


Gin 


Asn 


Ser 


Val 


Thr 


Pro 


Arg 


Arg 






40 










45 


Asn 


Gin 


Asp 


He 


Leu 


Glu 


Asn 


Thr 






55 










60 


Gin 


He 


Thr 


Thr 


Gly 


Arg 


Glu 


Ser 






70 










75 


Leu 


Leu 


Glu 


Pro 


Ala 


Val 


Glu 


Glu 






85 










90 


Val 


Ser 


Ser 


Val 


Thr 


Lys 


Arg 


Thr 






100 










105 


Val 


Glu 


Asn 


Gin 


Glu 


Ser 


Val 


Glu 






115 










120 


Ser 


Thr 


Val 


Thr 


Ser 


Pro 


Ser 


Arg 






130 










135 


Thr 


Asn 


Leu 


Asp 


Ala 


Ser 


Glu 


Asn 






145 










150 


Lys 


Ser 


Ser 


Asp 


Lys 


Gin 


Leu 


Arg 






160 










165 


Arg 


Gly 


Arg 


Glu 


Val 


Ser 


Pro 


Ser 






175 










180 


Leu 


Glu 


Ser 


Ser 


Gin 


Leu 


Thr 


Val 






190 










195 


Ala 


He 


Pro 


Arg 


Lys 


Arg 


Gly 


Arg 






205 










210 


Glu 


Asp 


Val 


Gly 


Ser 


Lys 


Ala 


Val 






220 










225 


Lys 


Lys 


Glu 


Ala 


Pro 


Ser 


He 


Arg 






235 










240 


Pro 


Ala 


Lys 


Ser 


Glu 


Asn 


Val 


Asp 






250 










255 


Lys 


Ser 


He 


Leu 


Val 


Pro 


Asn 


Glu 






265 










270 


Ser 


Lys 


Lys 


Lys 


Leu 


Thr 


Lys 


Lys 






280 










285 


Arg 


Ser 


Leu 


His 


Ser 


Val 


Ser 


Glu 






295 










300 


His 


Lys 


Glu 


Thr 


Asn 


Glu 


Gin 


Glu 






310 










315 


Ser 


Phe 


Thr 


Lys 


Ser 


Ser 


Arg 


Ser 






325 










330 


Ala 


He 


Leu 


Leu 


Pro 


Asp 


Leu 


Ser 






340 










345 


Phe 


Ser 


Pro 


Ala 


Ser 


Glu 


Val 


Pro 






355 










360 


He 


Glu 


Val 


Pro 


Ala 


Gin 


Leu 


Lys 






370 










375 


Ser 


Gin 


Phe 


Val 


He 


Ser 


Pro 


Pro 






385 










390 
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Ala 


Leu 


Arg 


Ser 


Arg 
395 


Gin 


Lys 


Glu 


Asp 


Glu 


Leu 


Lys 
410 


Asp 


Asp 


Lys 


Pro 


Lys 


Ala 


Lys 
425 


Arg 


He 


Ser 


Lys 


Asn 


Thr 


Glu 
440 


Lys 


Glu 


Glu 


He 


Arg 


Leu 


He 
455 


Ser 


Pro 


Lys 


Ser 


Lys 


Pro 


Arg 
470 


Lys 


Thr 


Gly 


Arg 


Asn 


Arg 


Lys 
485 


Lys 


Leu 


Arg 


Arg 


Lys 


Met 


Leu 
500 







Asn 


Thr 


Ser 


Asn 


LVS 


Asn 


Lys 


Leu 






400 










405 


Ala 


Gin 


Ser 


Val 


Glu 


Thr 


Leu 


Gly 






415 










420 


Arg 


Thr 


Ser 


Lys 


Thr 


Lys 


Gin 


Ala 






430 










435 


Ser 


Ala 


Trn 


Ser 


Pro 


Pro 


Pro 


He 






445 










450 


Leu 


Ala 


Ser 


Pro 


Ala 


Asp 


Gly Val 






460 










465 


Thr 


Glu 


Val 


Thr 


Gly 


Thr 


Gly 


Leu 






475 










480 


Ser 


Ser 


Tyr 


Pro 


Lys 


Gin 


He 


Leu 






490 










495 



<210> 16 

<211> 119 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2302032CD1 



<400> 16 










Met 


Asn 


Ala 


Ser 


Ser 


Glu 


Gly 


1 








5 






He 


Pro 


Gly 


Gly 


Thr 
20 


Thr 


Val 


His 


He 


Cys 


Gly 


He 
35 


Cys 


Lys 


Phe 


Val 


Ala 


His 


Lys 
50 


Gin 


Ser 


Ala 


Ala 


Ala 


Pro 


Ser 
65 


Thr 


Val 


Pro 


Ala 


Thr 


Gin 


Thr 
80 


Gin 


Thr 


Thr 


Gin 


Thr 


He 


Thr 
95 


Gly 


Thr 


Glu 


Leu 


Ala 


Trp 


Leu 
110 


Cys 


Leu 



Glu 


Ser 


Phe 


Ala 


Gly 


Ser 


Val 


Gin 






10 










15 


Leu 


Val 


Glu 


Leu 


Thr 


Pro 


Asp 


He 






25 










30 


Gin 


Gin 


Phe 


Asn 


Asn 


Leu 


Asp 


Ala 






40 










45 


Gly 


Cys 


Gin 


Leu 


Thr 


Gly 


Thr 


Ser 






55 










60 


Gin 


Phe 


Val 


Ser 


Glu 


Glu 


Thr 


Val 






70 










75 


Thr 


Thr 


Arg 


Thr 


He 


Thr 


Ser 


Glu 






85 










90 


Ala 


Gly 


Ala 


Trp 


Gly 


Ser 


Arg 


Pro 






100 










105 


Lys 


His 


Val 


His 


Gly 


Thr 


Cys 








115 













<210> 17 

<211> 544 

<212> PRT 

<213> Homo sapiens 

<220> 

<2 21> mi sc_f eature 

<223> Incyte ID No: 2326109CD1 



<400> 17 



Met 


He 


His 


Val 


Arg 


Arg 


His 


Glu 


Thr 


Arg 


Arg 


Asn 


Ser 


Lys 


Ser 


1 








5 










10 










15 


His 


Val 


Pro 


Glu 


Gin 


Lys 


Ser 


Arg 


Val 


Asp 


Trp 


Arg 


Arg 


Thr 


Lys 










20 










25 










30 


Arg 


Ser 


Ser 


lie 


Ser 


Gin 


Leu 


Leu 


Asp 


Ser 


Asp 


Glu 


Glu 


Leu 


Asp 








35 










40 










45 


Ser 


Glu 


Glu 


Phe 


Asp 


Ser 


Asp 


Glu 


Glu 


Leu 


Asp 


Ser 


Asp 


Glu 


Ser 










50 










55 










60 


Phe 


Glu 


Asn 


Asp 


Glu 


Glu 


Leu 


Asp 


Ser 


Asn 


Lys 


Gly 


Pro 


Asp 


Cys 










65 










70 










75 


Asn 


Lys 


Thr 


Pro 


Gly 


Ser 


Glu 


Arg 


Glu 


Leu 


Asn 


Leu 


Ser 


Lys 


He 








80 










85 










90 


Gin 


Ser 


Glu 


Gly Asn 


Asp 


Ser 


Lys 


Cys 


Leu 


He 


Asn 


Ser 


Gly 


Asn 
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95 






Gly 


Ser 


Thr 


Tyr 


Glu 
110 


Glu 


Glu 


He 


Asp 


Leu 


Gin 


Asp 
125 


Gin 


Glu 


Asp 


Leu 


Asn 


Lys 


Gin 
140 


Thr 


Gly 


Lys 


His 
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Phe 


Pro 
35 


He 


Pro 


Pro 


Gin 


Asn 


Leu 


Phe 


Asp 


Gly 
50 


Pro 


Glu 


Ser 


Gin 


Leu 


Val 


Ser 


Leu 


Leu 
65 


Gly 


Asp 


Gin 


Arg 


Asn 


Pro 


Met 


Gin 


Thr 
80 


Ser 


Ser 


Ser 


Val 


His 


Leu 


His 


Ser 


Phe 
95 


Ser 


Met 


Met 


Pro 


Met 


Val 


Gly 


His 


Glu 
110 


Met 


Ala 


Ser 


Asp 


Pro 


Phe 


Ser 


Asn 


Met 


Gly Asn 


Pro 


Met 










125 










Lys 


Ser 


Leu 


Phe 


Gin 
140 


Trp 


Gin 


Val 


Glu 


Ala 


Asn 


He 


Ser 


Gin 


Asp 


Gin 


Phe 


Leu 










155 










Asp 


Thr 


Phe 


Leu 


His 
170 


lie 


Ala 


Val 


Ala 


Ser 


Tyr. 


Val 


Leu 


Ala 
185 


Arg 


Lys 


Met 


Asn 


He 


Lys 


Glu 


His 


Asn 


Gly Gin 


Ser 


Ala 










200 










Ala 


Asn 


Gin 


His 


Leu 
215 


He 


Val 


Gin 


Asp 


Gin 


Val 


Asn 


Thr 


Thr 
230 


Asp 


Cys 


Trp 


Gly 


Cys 


Ala 


Glu 


Lys 


Gly 
245 


His 


Ser 


Gin 


Val 


Gly 


Ala 


Val 


Gly 


Ser 
260 


Asn 


Gin 


Phe 


Val 


Tyr 


Asp 


Gly 


Leu 


Thr 
275 


Pro 


Leu 


His 


Cys 


Ala 


Val 


Val 


His 


Glu 
290 


Leu 


Gin 


Arg 


Asn 


Glu 


Val 


Gin 


Glu 


Leu 
305 


Leu 


Leu 


Lys 


Asn 


He 


Lys 


Cys 


Leu 


He 
320 


Gin 


Met 


Gly 


Ala 


Tyr 


Asn 


Gly 


Asn 


Thr 
335 


Ala 


Leu 


His 


Val 


Arg 


Leu 


Thr 


Gin 


Leu 
350 


Asp 


Ala 


Val 


Arg 


Ala 


Asp 


Pro 


Ser 


Thr 
365 


Arg 


Asn 


Leu 


Glu 


Leu 


Val 


Pro 


Asp 


Gly 
380 


Pro 


Val 


Gly 


Glu 


Lys 


Gly 


Lys 


Ser 


He 
395 


Gin 


Gin 


Arg 


Ala 
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<211> 254 
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Tyr 


Ser 


Pro 


Gin 


Asn 


Gin 


10 










15 


His 


Lys 


Pro 


Thr 


Leu 


Glu 


25 










30 


Ser 


Pro 


Ala 


Tyr 


Glu 


Pro 


40 










45 


Phe 


Cys 


Pro 


Asn 


Gin 


Ser 


55 










60 


Glu 


Ser 


Glu 


Asn 


He 


Ala 


70 










75 


Gin 


Gin 


Gin 


Asn 


Asp 


Ala 


85 










90 


Ser 


Ser 


Ala 


Cys 


Glu 


Ala 


100 










105 


Ser 


Ser 


Asn 


Thr 


Ser 


Leu 


115 










120 


Asn 


Thr 


Thr 


Gin 


Leu 


Gly 


130 










135 


Gin 


Glu 


Glu 


Ser 


Lys 


Leu 


145 










15 0 


Ser 


Lys 


Asp 


Ala 


Asp 


Gly 


160 










165 


Gin 


Gly Arg 


Arg 


Ala 


Leu 


175 










180 


Ala 


Leu 


His 


Met 


Leu 


Asp 


190 










195 


Phe 


Gin 


Val 


Ala 


Val 


Ala 


205 










210 


Leu 


Val 


Asn 


He 


Gly 


Ala 


220 










225 


Arg 


Thr 


Pro 


Leu 


His 


Val 


235 










240 


Leu 


Gin 


Ala 


He 


Gin 


Lys 


250 










255 


Asp 


Leu 


Glu 


Ala 


Thr 


Asn 


265 










270 


Ala 


Val 


He 


Ala 


His 


Asn 


280 










285 


Gin 


Gin 


Pro 


His 


Ser 


Pro 


295 










300 


Lys 


Ser 


Leu 


Val 


Asp 


Thr 


310 










315 


Ala 


Val 


Glu 


Ala 


Lys 


Ala 


325 










330 


Ala 


Ala 


Ser 


Leu 


Gin 


Tyr 


340 










345 


Leu 


Leu 


Met 


Arg 


Lys 


Gly 


355 










360 


Asn 


Glu 


Gin 


Pro 


Val 


His 


370 










375 


Gin 


He 


Arg 


Arg 


He 


Leu 


385 










390 


Pro 


Pro 


Tyr 









400 
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<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2637522CD1 



<400> 22 



Met 


Asn 


Phe 


Thr 


Val 


Gly 


Phe 


Lys 


Pro 


Leu 


Leu 


Gly 


Asp 


Ala 


His 


1 








5 










10 










15 


Ser 


Met 


Asp 


Asn 


Leu 


Glu 


Lys 


Gin 


Leu 


He 


Cys 


Pro 


He 


Cys 


Leu 








20 










25 










30 


Glu 


Met 


Phe 


Ser 


Lys 


Pro 


Val 


Val 


He 


Leu 


Pro 


Cys 


Gin 


His 


Asn 










35 










40 










45 


Leu 


Cys 


Arg 


Lys 


Cys 


Ala 


Asn 


Asp 


Val 


Phe 


Gin 


Ala 


Ser 


Asn 


Pro 








50 










55 










60 


Leu 


Trp 


Gin 


Ser 


Arg 


Gly 


Ser 


Thr 


Thr 


Val 


Ser 


Ser 


Gly 


Gly 


Arg 








65 










70 










75 


Phe 


Arg 


Cys 


Pro 


Ser 


Cvs 


Arg 


His 


Glu 


Val 


Val 


Leu 


Asp 


Arg 


His 






80 










85 










90 


Gly Val 


Tyr 


Gly 


Leu 


Gin 


Arg 


Asn 


Leu 


Leu 


Val 


Glu 


Asn 


He 


He 








95 










100 










105 


Asp 


He 


Tyr 


Lys 


Gin 


Glu 


Ser 


Ser 


Arg 


Pro 


Leu 


His 


Ser 


Lys 


Ala 






110 










115 










120 


Glu 


Gin 


His 


Leu 


Met 


Cvs 


Glu 


Glu 


His 


Glu 


Glu 


Glu 


Lys 


He 


Asn 










12 5 










130 










135 


He 


Tyr 


Cys 


Leu 


Ser 


Cys 


Glu 


Val 


Pro 


Thr 


Cvs 


Ser 


Leu 


Cys 


Lys 






140 










145 










150 


Val 


Phe 


Gly Ala 


His 


Lys 


Asp 


Cys 


Glu 


Val 


Ala 


Pro 


Leu 


Pro 


Thr 










155 










160 










165 


He 


Tyr 


Lys 


Arg 


Gin 


Lys 


Ser 


Glu 


Leu 


Ser 


Asp 


Gly 


He 


Ala 


Met 










170 










175 










180 


Leu 


Val 


Ala 


Gly 


Asn 


Asp 


Arg 


Val 


Gin 


Ala 


Val 


He 


Thr 


Gin 


Met 










185 










190 










195 


Glu 


Glu 


Val 


Cys 


Gin 


Thr 


He 


Glu 


Asp 


Asn 


Ser 


Arg 


Arg 


Gin 


Lys 








200 










205 










210 


Gin 


Leu 


Leu 


Asn 


Gin 


Arg 


Phe 


Glu 


Ser 


Leu 


Cys 


Ala 


Val 


Leu 


Glu 










215 










220 










225 


Glu 


Arg 


Asn 


Gly 


Glu 


Leu 


Leu 


Gin 


Ala 


Leu 


Ala 


Arg 


Glu 


Gin 


Ala 








230 










235 










240 


Gly 


Gin 


Ala 


Ser 


Thr 


Arg 


Ser 


Asp 


Gly 


Thr 


His 


Ser 


Gly 


Gin 










245 










250 
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<221> misc_feature 

<223> Incyte ID No: 2650980CD1 

<400> 23 



Met 


Ala 


Thr 


Asp 


Thr 


Ser 


Gin 


Gly 


Glu 


Leu 


Val 


His 


Pro 


Lys 


Ala 


1 






5 










10 










15 


Leu 


Pro 


Leu 


He 


Val 


Gly Ala 


Gin 


Leu 


He 


His 


Ala 


Asp 


Lys 


Leu 










20 










25 










30 


Gly 


Glu 


Lys 


Val 


Glu 


Asp 


Ser 


Thr 


Met 


Pro 


He 


Arg 


Arg 


Thr 


Val 






35 










40 










45 


Asn 


Ser 


Thr 


Arg 


Glu 
50 


Thr 


Pro 


Pro 


Lys 


Ser 
55 


Lys 


Leu 


Ala 


Glu 


Gly 
60 


Glu 


Glu 


Glu 


Lys 


Pro 


Glu 


Pro 


Asp 


He 


Ser 


Ser 


Glu 


Glu 


Ser 


Val 








65 










70 










75 


Ser 


Thr 


Val 


Glu 


Glu 
80 


Gin 


Glu 


Asn 


Glu 


Thr 
85 


Pro 


Pro 


Ala 


Thr 


Ser 
90 


Ser 


Glu 


Ala 


Glu 


Gin 
95 


Pro 


Lys 


Gly 


Glu 


Pro 
100 


Glu 


Asn 


Glu 


Glu 


Lys 
105 
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Glu 


Glu 


Asn 


Lys 


Ser 


Ser 


Glu 


Glu 


Thr 


Lys 


Lys 


Asp 


Glu 


Lvs 


Asp 










110 










115 










120 


Gin 


Ser 


Lvs 


Glu 


Lvs 


Glu 


Lvs 


Lvs 


Val 


Lys 


Lys 


Thr 


lie 


Pro 


Ser 










125 










130 










135 


Trp 


Ala 


Thr 


Leu 


Ser 


Ala 


Ser 


Gin 


Leu 


Ala 


Ara 


Ala 


Gin 


Lys 


Gin 










140 










145 










150 


Thr 


Pro 


Met 


Ala 


Ser 


Ser 


Pro 


Arg 


Pro 


Lys 


Met 


Asp 


Ala 


lie 


Leu 










155 










160 










165 


Thr 


Glu 


Ala 


lie 


Lys 


Ala 


Cys 


Phe 


Gin 


Lvs 


Ser 


Gly 


Ala 


Ser 


Val 










170 










175 










180 


Val 


Ala 


lie 


Ara 


Lys 


Tyr 


lie 


lie 


His 


Lvs 


Tvr 


Pro 


Ser 


Leu 


Glu 










185 










190 










195 


Leu 


Glu 


Ara 


Ara 


Gly 


Tyr 


Leu 


Leu 


Lys 


Gin 


Ala 


Leu 


Lys 


Arg 


Glu 










200 










205 










210 


Leu 


Asn 


Arg 


Glv 


Val 


lie 


Lys 


Gin 


Val 


Lys 


Glv 


Lvs 


Glv 


Ala 


Ser 










215 










220 










225 


Glv 


Ser 


Phe 


Val 


Val 


Val 


Gin 


Lvs 


Ser 


Arg 


Lvs 


Thr 


Pro 


Gin 


Lys 










230 










235 










240 


Ser 


Ara 

-rax ^ 


Asn 


Arg 


Lvs 


Asn 


Arg 


Ser 


Ser 


Ala 


Val 


Asp 


Pro 


Glu 


Pro 










245 










250 










255 


Gin 


Val 


Lys 


Leu 


Glu 


Asp 


Val 


Leu 


Pro 


Leu 


Ala 


Phe 


Thr 


Arg 


Leu 








260 










2 65 










270 


Cvs 


Glu 


Pro 


Lys 


Glu 


Ala 


Ser 


Tvr 


Ser 


Leu 


lie 


Ara 


Lvs 


Tvr 


Val 










275 










280 










285 


Ser 


Gin 


Tvr 


Tvr 


Pro 


Lys 


Leu 


Arg 


Val 


Asp 


lie 


Ara 


Pro 


Gin 


Leu 










290 










2 95 










300 


Leu 


Lys 


Asn 


Ala 


Leu 


Gin 


Arg 


Ala 


Val 


Glu 


Arg 


Glv 


Gin 


Leu 


Glu 










305 










310 










315 


Gin 


lie 


Thr 


Glv 


Lys 


Gly Ala 


Ser 


Gly 


Thr 


Phe 


Gin 


Leu 


Lys 


Lys 










32 0 










325 










330 


Ser 


Gly 


Glu 


Lys 


Pro 


Leu 


Leu 


Gly Gly 


Ser 


Leu 


Met 


Glu 


Tvr 
j 


Ala 










335 










340 










3.4 5 


lie 


Leu 


Ser 


Ala 


lie 


Ala 


Ala 


Met 


Asn 


Glu 


Pro 


Lvs 


Thr 


Cvs 


Ser 










3 50 










3 55 










3 60 


Thr 


Thr 


Ala 


Leu 


Lys 


Lys 


Tyr 


Val 


Leu 


Glu 


Asn 


His 


Pro 


Glv 

\j j-y 


Thr 










3 6 5 










37 0 










37 5 


Asn 


- Ser 


Asn 


Tvr 


Gin 


Met 


His 


Leu 


Leu 


Lvs 


Lys 


Thr 


Leu 


Gin 


Lvs 










3 80 










3 85 










3 9 0 


Cvs 


Glu 


Lys 


Asn 


Glv 


Trp 


Met 


Glu 


Gin 


lie 


Ser 


Glv 


Lys 


Glv 


Phe 










3 95 










400 










405 


Ser 


Glv 


Thr 


Phe 


Gin 


Leu 


Cys 


Phe 


Pro 


Tvr 


Tvr 


Pro 


Ser 


Pro 


Glv 

\j j-y 










410 










415 










42 0 


Val 


Leu 


Phe 


Pro 


Lys 


Lys 


Glu 


Pro 


Asp 


Asp 


Ser 


Ara 


Asp 


Glu 


Asp 










425 










430 










435 


Glu 


Asp 


Glu 


Asp 


Glu 


Ser 


Ser 


Glu 


Glu 


Asp 


Ser 


Glu 


Asp 


Glu 


Glu 










440 










445 










450 


Pro 


Pro 


Pro 


Lys 


Ara 


Arg 


Leu 


Gin 


Lys 


Lvs 


Thr 


Pro 


Ala 


Lvs 


Ser 










455 










4 60 










465 


Pro 


Glv 


Lys 


Ala 


Ala 


Ser 


val 


Lys 


Gin 


Arg 


Glv 


Ser 


Lys 


Pro 


Ala 










470 










475 










480 


Pro 


Lys 


Val 


Ser 


Ala 


Ala 


Gin 


Arg 


Gly 


Lys 


Ala 


Ara 


Pro 


Leu 


Pro 










485 










490 










495 


Lys 


Lys 


Ala 


Pro 


Pro 


Lys 


Ala 


Lys 


Thr 


Pro 


Ala 


Lys 


Lys 


Thr 


Arg 










500 










505 










510 


Pro 


Ser 


Ser 


Thr 


Val 


lie 


Lys 


Lys 


Pro 


Ser 


Gly 


Gly 


Ser 


Ser 


Lys 










515 










520 










525 


Lys 


Pro 


Ala 


Thr 


Ser 


Ala 


Arg 


Lys 


Glu 


Val 


Lys 


Leu 


Pro 


Gly 


Lys 










530 










535 










540 


Gly 


Lys 


Ser 


Thr 


Met 


Lys 


Lys 


Ser 


Phe 


Arg 


Val 


Lys 


Lys 







545 550 
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<221> misc^feature 

<223> Incyte ID No: 2939607CD1 

<400> 24 



Met 


Asp 


Ala 


Lys 


Ser 


Leu 


Thr 


Ala 


Trp 


Ser 


Arg 


Thr 


Leu 


Val 


Thr 


1 








5 










10 










15 


Phe 


Lys 


Asp 


Val 


Phe 


Val 


Asp 


Phe 


Thr 


Arg 


Glu 


Glu 


Trp 


Lys 


Leu 








20 










25 










30 


Leu 


Asp 


Thr 


Ala 


Gin 


Gin 


He 


Val 


Tyr 


Arg 


Asn 


Val 


Met 


Leu 


Glu 








35 










40 










45 


Asn 


Tyr 


Lys 


Asn 


Leu 


Val 


Ser 


Leu 


Gly 


Tyr 


Gin 


Leu 


Thr 


Lys 


Pro 










50 










55 










60 


Asp 


Val 


He 


Leu 


Arg 


Leu 


Glu 


Lys 


Gly 


Glu 


Glu 


Pro 


Trp 


Leu 


Val 










65 










70 










75 


Glu 


Arg 


Glu 


lie 


His 


Gin 


Glu 


Thr 


His 


Pro 


Asp 


Ser 


Glu 


Thr 


Ala 










80 










85 










90 


Phe 


Glu 


He 


Lys 


Ser 


Ser 


Val 


Ser 


Ser 


Arg 


Ser 


lie 


Phe 


Lys 


Asp 










95 










100 










105 


Lvs 


Gin 


Ser 


Cys 


Asp 


He 


Lys 


Met 


Glu 


Gly 


Met 


Ala 


Arg 


Asn 


Asp 










110 










115 










120 


Leu 


Tro 


Tvr 


Leu 


Ser 


Leu 


Glu 


Glu 


Val 


Trp 


Lys 


Cys 


Arg 


Asp 


Gin 








125 










130 










135 


Leu 


Asp 


LVS 


Tvr 


Gin 


Glu 


Asn 


Pro 


Glu 


Arg 


His 


Leu 


Arg 


Gin 


Val 






140 










145 










150 


Ala 


Phe 


Thr 


Gin 


Lvs 


Lys 


Val 


Leu 


Thr 


Gin 


Glu 


Arg 


Val 


Ser 


Glu 










155 










160 










165 


Ser 


Glv 


Lvs 


Tvr 


Gly 


Gly 


Asn 


Cys 


Leu 


Leu 


Pro 


Ala 


Gin 


Leu 


Val 










170 










175 










180 


Leu 


Aro 


Glu 


Tvr 


Phe 


His 


Lys 


Arg 


Asp 


Ser 


His 


Thr 


Lys 


Ser 


Leu 










185 










190 










195 


Lys 


His 


Asp 


Leu 


Val 


Leu 


Asn 


Gly 


His 


Gin 


Asp 


Ser 


Cys 


Ala 


Ser 










200 










205 










210 


Asn 


Ser 


Asn 


Glu 


Cvs 


Glv 


Gin 


Thr 


Phe 


Cys 


Gin 


Asn 


He 


His 


Leu 










215 










220 










225 


lie 


Gin 


Phe 


Ala 


Ara 


Thr 


His 


Thr 


Glv 


Asp 


Lys 


Ser 


Tvr 


Lys 


Cys 










230 










235 










240 


Pro 


Asp 


Asn 


Asp 


Asn 


Ser 


Leu 


Thr 


His 


Gly 


Ser 


Ser 


Leu 


Gly 


He 










245 










250 










255 


Ser 


Lys 


Glv 


He 


His 


•Arg 


Glu 


Lys 


Pro 


Tyr 


Glu 


Cys 


Lys 


Glu 


Cys 










260 










265 










270 


Glv 


Lvs 


Phe 


Phe 


Ser 


Trp 


Arg 


Ser 


Asn 


Leu 


Thr 


Arg 


His 


Gin 


Leu 










275 










280 










285 


lie 


His 


Thr 


Glv 


Glu 


Lvs 


Pro 


Tyr 


Glu 


Cys 


Lys 


Glu 


Cys 


Gly 


Lys 










290 










295 










300 


Ser 


Tvr 


Ser 


Gin 


Arg 


Ser 


His 


Leu 


Val 


Val 


His 


His 


Arg 


He 


His 








305 










310 










315 


Thr 


Glv 


Leu 


LVS 


Pro 


Phe 


Glu 


Cys 


Lys 


Asp 


Cys 


Gly 


Lys 


Cys 


Phe 










320 










325 










330 


Ser* 


Ara 


Ser 


Ser 


His 


Leu 


Tyr 


Ser 


His 


Gin 


Arg 


Thr 


His 


Thr 


Gly 










335 










340 










345 


Glu 


Lys 


Pro 


Tvr 


Glu 


Cvs 


His 


Asp 


Cys 


Gly 


Lys 


Ser 


Phe 


Ser 


Gin 










350 










355 










360 


Ser 


Ser 


Ala 


Leu 


He 


Val 


His 


Gin 


Arg 


He 


His 


Thr 


Gly 


Glu 


Lys 










365 










370 










375 


Pro 


Tyr 


Glu 


Cys 


Cys 


Gin 


Cys 


Gly 


Lys 


Ala 


Phe 


He 


Arg 


Lys 


Asn 










380 










385 










390 


Asp 


Leu 


He 


Lys 


His 


Gin 


Arg 


He 


His 


Val 


Gly 


Glu 


Glu 


Thr 


Tyr 










395 










400 










405 


Lys 


Cys 


Asn 


Gin 


Cys 


Gly 


He 


He 


Phe 


Ser 


Gin 


Asn 


Ser 


Pro 


Phe 










410 










415 










420 


He 


Val 


His 


Gin 


He 


Ala 


His 


Thr 


Gly 


Glu 


Gin 


Phe 


Leu 


Thr 


Cys 










425 










430 










435 


Asn 


Gin 


Cys 


Gly 


Thr 


Ala 


Leu 


Val 


Asn 


Thr 


Ser 


Asn 


Leu 


He 


Gly 










440 










445 










450 


Tyr 


Gin 


Thr 


Asn 


His 


He 


Arg 


Glu 


Lys 


Ala 


Tyr 











455 460 
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<210> 25 
<211> 159 
<212> PRT 
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<220> 

<221> mis cofeature 

<223> Incyte ID No: 3098421CD1 

<400> 25 



Met 


Asp 


Lys 


Pro 


Arg 


Lys 


Glu 


Asn 


Glu 


Glu 


Glu 


Pro 


Gin 


Ser 


Ala 


1 








5 










10 










15 


Pro 


Lys 


Thr 


Asp 


Glu 


.Glu 


Arg 


Pro 


Pro 


Val 


Glu 


His 


Ser 


Pro 


Glu 








20 










25 










30 


Lys 


Gin 


Ser 


Pro 


Glu 


Glu 


Gin 


Ser 


Ser 


Glu 


Glu 


Gin 


Ser 


Ser 


Glu 








35 










40 










45 


Glu 


Glu 


Phe 


Phe 


Pro 


Glu 


Glu 


Leu 


Leu 


Pro 


Glu 


Leu 


Leu 


Pro 


Glu 










50 










55 










60 


Met 


Leu 


Leu 


Ser 


Glu 


Glu 


Arg 


Pro 


Pro 


Gin 


Glu 


Gly 


Leu 


Ser 


Arg 










65 










70 










75 


Lys 


Asp 


Leu 


Phe 


Glu 


Gly 


Arg 


Pro 


Pro 


Met 


Glu 


Gin 


Pro 


Pro 


Cys 










80 










85 










90 


Gly 


Val 


Gly 


Lys 


His 


Lys 


Leu 


Glu 


Glu 


Gly 


Ser 


Phe 


Lys 


Glu 


Arg 










95 










100 










105 


Leu 


Ala 


Arg 


Ser 


Arg 


Pro 


Gin 


Phe 


Arg 


Gly 


Asp 


He 


His 


Gly 


Arg 










110 










115 










120 


Asn 


Leu 


Ser 


Asn 


Glu 


Glu 


Met 


He 


Gin 


Ala 


Ala 


Asp 


Glu 


Leu 


Glu 










125 










130 










135 


Glu 


Met 


Lys 


Arg 


Val 


Arg 


Asn 


Lys 


Leu 


Met 


He 


Met 


His 


Trp 


Lys 










140 










145 










150 


Ala 


Lys 


Arg 


Ser 


Arg 


Pro 


Tyr 


Pro 


He 















155 

<210> 26 
<211> 373 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> unsure 
<222> 368 

<223> unknown, or other 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3296650CD1 

<400> 26 



Met 


Lys 


Ala 


Leu 


Phe 


Lys 


His 


Glu 


Ser 


Leu 


Gly 


Ser 


Gin 


Pro 


Leu 


1 








5 










10 










15 


His 


Asp 


Arg 


val 


Leu 


Gin 


Val 


Pro 


Gly 


Leu 


Ala 


Gin 


Gly 


Gly 


Cys 










20 










25 










30 


Cys 


Arg 


Glu 


Asp 


Ala 


Met 


Val 


Ala 


Ser 


Arg 


Leu 


Thr 


Pro 


Gly 


Ser 










35 










40 










45 


Gin 


Gly 


Leu 


Leu 


Lys 


Met 


Glu 


Asp 


Val 


Ala 


Leu 


Thr 


Leu 


Thr 


Pro 










50 










55 










60 


Gly 


Trp 


Thr 


Gin 


Leu 


Asp 


Ser 


Ser 


Gin 


Val 


Asn 


Leu 


Tyr 


Arg 


Asp 










65 










70 










75 


Glu 


Lys 


Gin 


Glu 


Asn 


His 


Ser 


Ser 


Leu 


Val 


Ser 


Leu 


Gly 


Gly 


Glu 










80 










85 










90 


He 


Gin 


Thr 


Lys 


Ser 


Arg 


Asp 


Leu 


Pro 


Pro 


Val 


Lys 


Lys 


Leu 


Pro 










95 










100 










105 


Glu 


Lys 


Glu 


His 


Gly 


Lys 


He 


Cys 


His 


Leu 


Arg 


Glu 


Asp 


He 


Ala 










110 










115 










120 


Gin 


He 


Pro 


Thr 


His 


Ala 


Glu 


Ala 


Gly 


Glu 


Gin 


Glu 


Gly 


Arg 


Leu 










125 










130 










135 


Gin 


Arg 


Lys 


Gin 


Lys 


Asn 


Ala 


He 


Gly 


Ser 


Arg 


Arg 


His 


Tyr 


Cys 
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140 



His 


Glu 


Cys 


Gly 


Lys 
155 


Ser 


Phe 


Ala 


His 


Arg 


Arg 


lie 


His 
170 


Thr 


Gly 


Glu 


Cys 


Gly 


Lys 


Thr 


Phe 
185 


He 


Gly 


Ser 


Arg 


Val 


His 


Thr 


Gly 
200 


Glu 


Lys 


Pro 


Lys 


Val 


Phe 


.Ser 


His 
215 


Ser 


Ser 


Asn 


His 


Thr 


Gly Glu 


Lys 


Pro 


Tyr 


Glu 










230 








Phe 


Ser 


Gin 


Ser 


Cys 
245 


Ser 


Leu 


Leu 


Gly 


Glu 


Lys 


Pro 


Tyr 
260 


Gin 


Cys 


Asn 


Arg 


Asn 


Ser 


His 


Leu 
275 


Leu 


Arg 


His 


Asn 


Val 


Gin 


Asn 


Pro 
290 


Glu 


His 


Gly 


Arg 


Thr 


Glu 


Ser 


Gin 
305 


Trp 


Glu 


Asn 


Lys 


Cys 


Asn 


Glu 


Cys 
320 


Glu 


Arg 


Ser 


lie 


Glu 


His 


Gin 


Lys 
335 


He 


His 


Thr 


Asp 


Thr 


Cys 


Gly 


Lys 
350 


Gly 


Phe 


Thr 


His 


Gin 


Arg 


Ser 


His 


Val 


Gly 


Xaa 



365 

<210> 27 

<211> 330 

<212> PRT 

<213> Homo sapiens 

<220> 

<2 21> misc_f eature 

<223> Incyte ID No: 3687719CD1 

<400> 27 



Met 


Gin 


Gin 


Gin 


Pro 


Leu 


Pro 


Gly 


1 








5 








Pro 


Thr 


Lys 


Pro 


Pro 


Tyr 


Ser 


Tyr 










20 








He 


Gin 


Ser 


Ser 


Pro 


Gly 


Gin 


Arg 










35 








Arg 


Tyr 


He 


Met 


Gly 


Arg 


Phe 


Ala 










50 








Gly 


Trp 


Gin 


Asn 


Ser 


He 


Arg 


His 










65 








Phe 


Val 


Lys 


Val 


Pro 


Arg 


Asp 


Asp 










80 








Tyr 


Trp 


Thr 


Leu 


Asp 


Pro 


Asp 


Cys 










95 








Ser 


Phe 


Leu 


Arg 


Arg 


Arg 


Arg 


Arg 










110 








Glu 


Gly 


Thr 


Arg 


Gly 


Pro 


Ala 


Lys 










125 








Ala 


Thr 


Ser 


Gin 


Asp 


Pro 


Gly 


Val 










140 








Gin 


Cys 


Ser 


Phe 


Pro 


Pro 


Glu 


Leu 










155 








Phe 


Gly 


Gly Leu 


Val 


Gly 


Ala 


Met 










170 








Thr 


Thr 


Asp Gly Arg 


Pro 


Arg 


Pro 





145 










150 


Gin 


Ser 


Ser 


Gly 


Leu 


Thr 


Lys 




160 










165 


Lys 


Pro 


Tyr 


Glu 


Cys 


Glu 


Asp 




175 










180 


Ser 


Ala 


Leu 


Val 


He 


His 


Gin 




190 










195 


Tyr 


Glu 


Cys 


Glu 


Glu 


Cys 


Gly 




205 










210 


Leu 


He 


Lys 


His 


Gin 


Arg 


Thr 




220 










225 


Cys 


Asp 


Asp 


Cys 


Gly 


Lys 


Thr 




235 










240 


Glu 


His 


His 


Lys 


He 


His 


Thr 




250 










255 


Met 


Cys 


Gly 


Lys 


Ala 


Phe 


Arg 




265 










270 


Gin 


Arg 


He 


His 


Gly 


Asp 


Lys 




280 










285 


Glu 


Ser 


Trp 


Glu 


Ser 


Gin 


Gly 




295 










300 


Thr 


Glu 


Ala 


Pro 


Val 


Ser 


Tyr 




310 










315 


Phe 


Thr 


Arg 


Asn 


Arg 


Ser 


Leu 




325 










330 


Gly Asp 


Lys 


Pro 


Tyr 


Gin 


Cys 




340 










345 




Thr 


Ser 


Tyr 


Leu 


Val 


Gin 




355 










360 


Lys 


Thr 


Leu 


Ser 


Gin 








370 












Pro 


Gly 


Ala 


Pro 


Thr 


Thr 


Glu 




10 










15 


lie 


Ala 


Leu 


He 


Ala 


Met 


Ala 




25 










30 


Ala 


Thr 


Leu 


Ser 


Gly 


He 


Tyr 




40 










45 


Phe 


Tyr 


Arg 


His 


Asn 


Arg 


Pro 




55 










60 


Asn 


Leu 


Ser 


Leu 


Asn 


Glu 


Cys 




70 










75 


Arg 


Lys 


Pro 


Gly 


Lys 


Gly 


Ser 




85 










90 


His 


Asp 


Met 


Phe 


Glu 


His 


Gly 




100 










105 


Phe 


Thr 


Arg 


Gin 


Thr 


Gly 


Ala 




115 










120 


Ala 


Arg 


Arg 


Gly 


Pro 


Leu 


Arg 




130 










135 


Pro 


Asn 


Ala 


Thr 


Thr 


Gly 


Arg 




145 










150 


Pro 


Asp 


Pro 


Lys 


Gly 


Leu 


Ser 




160 










165 


Pro 


Ala 


Ser 


Met 


Cys 


Pro 


Ala 




175 










180 


Pro 


Met 


Glu 


Pro 


Lys 


Glu 


He 
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185 








Ser 


Thr 


Pro 


Lys 


Pro 
200 


Ala 


Cys 


Pro 


Ser 


Ser 


Ser 


Ser 


Cys 
215 


Pro 


Ala 


Phe 


Glu 


Ala 


Glu 


Ser 


Phe 
230 


Asn 


Lys 


Ala 


Glu 


Ser 


Gly 


He 


Gly 
245 


Ser 


Ser 


Tyr 


Asn 


Phe 


Cys 


Met 


Gly 
260 


Ala 


Asp 


Pro 


Ser 


Ala 


Ala 


Pro 


Ser 
275 


Pro 


Ala 


Pro 


Arg 


Ala 


Pro 


Leu 


Pro 
290 


Leu 


Pro 


Thr 


Ala 


Gly 


Gly 


Phe 


Pro 
305 


Val 


Gin 


Gly 


Leu 


Thr 


Pro 


Cys 


Leu 
320 


Tyr 


Arg 


Thr 



<210> 28 
<211> 396 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3774188CD1 

<400> 28 



Met 


Lys 


Ala 


Val 


Lys 


Ser 


Glu 


Arg 


1 








5 








His 


Arg 


Asp 


Gly 


Asp 
20 


Val 


Val 


Leu 


Gin 


Glu 


Arg 


Leu 


Ser 
35 


Pro 


Glu 


Val 


Pro. 


Asp 


His 


Ser 


Gly 
50 


Gly 


Ser 


Pro 


Ala 


Arg 


Ser 


Gly 


His 


Arg 


Gly Asn 










65 








Ser 


Pro 


Pro 


Lys 


Lys 
80 


Lys 


Asn 


Lys 


Ser 


Pro 


Arg 


Ser 


Lys 
95 


Arg 


Asn 


Arg 


Lys 


Val 


Lys 


Gin 


Glu 
110 


Arg 


Glu 


Asp 


Asp 


Arg 


Gin 


His 


Arg 
125 


Glu 


Pro 


Ser 


Arg 


Asn 


Ser 


Asp 


Arg 
140 


Asp 


Arg 


His 


Arg 


Thr 


Ser 


Asn 


Glu 
155 


Arg 


Pro 


Gly 


Asp 


Arg 


Asp 


Thr 


Gin 
170 


Asn 


Leu 


Gin 


Phe 


Tyr 


Asn 


Ala 


Arg 
185 


Arg 


Arg 


Glu 


Gly 


Gly 


Gly 


Gly 


Ser 
200 


Glu 


Ser 


Gin 


Gly 


Asn 


Asn 


Lys 


Glu 
215 


Lys 


Glu 


Val 


Phe 


Glu 


Leu 


Ser 


Gly 
230 


Ala 


Leu 


Leu 


Gly 


Val 


Val 


He 


Lys 
245 


Tyr 


Ser 


Glu 


Lys 


Lys 


Arg 


Trp 


Arg 
260 


Leu 


Tyr 


Pro 


Pro 


Val 


Met 


Tyr 


He 


His 


Arg 


Gin 



190 195 



Gly 


Glu 


Leu 


Pro 


Val 


Ala 


Thr 




205 










210 


Gly 


Phe 


Pro 


Ala 


Gly 


Phe 


Ser 




220 










225 


Pro 


Thr 


Pro 


Val 


Leu 


Ser 


Pro 




235 










240 


Gin 


Cys 


Arg 


Leu 


Gin 


Ala 


Leu 




250 










255 


Gly 


Leu 


Glu 


His 


Leu 


Leu 


Ala 




265 










270 


Pro 


Thr 


Pro 


Pro 


Gly 


Ser 


Leu 




280 










285 


Asp 


His 


Lys 


Glu 


Pro 


Trp 


Val 




295 










300 


Gly 


Ser 


Gly 


Tyr 


Pro 


Leu 


Gly 




310 










315 


Pro 


Gly 


Met 


Phe 


Phe 


Phe 


Glu 




325 










330 


Glu 


Ara 


Glv 


Ser 


Ara 


Ara 


Ara 




10 










15 


Pro 


Ala 


Glv 


Val 


Val 


Val 


LVS 




25 










30 


Ala 


Pro 


Pro 


Ala 


His 


Ara 


Ara 




40 










45 


Ser 


Pro 


Pro 


Thr 


Ser 


Glu 


Pro 




55 










60 


Ar cr 


Ala 


Ara 


Glv 


Val 


Ser 


Ara 




70 










75 


Ala 


Ser 


Glv 


Arg 


Arg 


Ser 


Lvs 




85 










90 


Ser 


Pro 


His 


His 


Ser 


Thr 


Val 




100 










105 


His 


Pro 


Ara 


Ara 


Glv 


Arg 


Glu 




115 










120 


Glu 


Gin 


Glu 


His 


Arg 


Arg 


Ala 




130 










135 


Arg Gly 


His 


Ser 


His 


Gin 


Arg 




145 










150 


Ser 


Gly 


Gin 


Gly 


Gin 


Gly 


Arg 




160 










165 


Ala 


Gin 


Glu 


Glu 


Glu 


Arg 


Glu 




175 










180 


His 


Arg 


Gin 


Arg 


Asn 


Asp 


Val 




190 










195 


Glu 


Leu 


Val 


Pro 


Arg 


Pro 


Gly 




205 










210 


Pro 


Ala 


Lvs 


Glu 


Lys 


Pro 


Ser 




220 










225 


Glu 


Asp 


Thr 


Asn 


Thr 


Phe 


Arg 




235 










240 


Pro 


Pro 


Glu 


Ala 


Arg 


He 


Pro 




250 










255 


Phe 


Lys 


Asn 


Asp 


Glu 


Val 


Leu 




265 










270 


Ser 


Ala 


Tyr 


Leu 


Leu 


Gly 


Arg 
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275 










280 










2 85 


His 


Arg 


Arg 


He 


Ala 
2 90 


Asp 


He 


Pro 


He 


Asp 
295 


His 


Pro 


Ser 


Cys 


Ser 
300 


Lys 


Gin 


His 


Ala 


Val 
305 


Phe 


Gin 


Tyr 


Arg 


Leu 
310 


Val 


Glu 


Tyr 


Thr 


Arg 
315 


Ala 


Asp 


Gly 


Thr 


Val 
320 


Gly 


Arg 


Arg 


Val 


Lys 
325 


Pro 


Tyr 


He 


He 


Asp 
330 


Leu 


Gly 


Ser 


Gly 


Asn 
3 35 


Gly 


Thr 


Phe 


Leu 


Asn 
340 


Asn 


Lys 


Arg 


He 


Glu 
3 45 


Pro 


Gin 


Arg 


Tyr 


Tyr 
3 50 


Glu 


Leu 


Lys 


Glu 


Lys 
3 55 


Asp 


Val 


Leu 


Lys 


Phe 
3 60 


Gly 


Phe 


Ser 


Ser 


Arg 
365 


Glu 


Tyr 


Val 


Leu 


Leu 
370 


His 


Glu 


Ser 


Ser 


Asp 
375 


Thr 


Ser 


Glu 


He 


Asp 
380 


Arg 


Lys 


Asp 


Asp 


Glu 
385 


Asp 


Glu 


Glu 


Glu 


Glu 
390 


Glu 


Glu 


Val 


Ser 


Asp 
395 


Ser 





















<210> 29 
<211> 126 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 4349106CD1 

<400> 29 



Met 


Gly 


Leu 


Leu 


Thr 


Phe 


Arg 


Asp 


Val 


Ala 


He 


Glu 


Phe 


Ser 


Arg 


1 






5 










10 










15 


Glu 


Glu 


Trp 


Glu 


His 
20 


Leu 


Asp 


Ser 


Asp 


Gin 
25 


Lys 


Leu 


Leu 


Tyr 


Gly 
30 


Asp 


Val 


Met 


Leu 


Glu 
35 


Asn 


Tyr 


Gly 


Asn 


Leu 
40 


Val 


Ser 


Leu 


Gly 


Leu 
45 


Ala 


Val 


Ser 


Lys 


Pro 
50 


Asp 


Leu 


lie 


Thr 


Phe 
55 


Leu 


Glu 


Gin 


Arg 


Lys 
60 


Glu 


Pro 


Trp 


Asn 


Val 
65 


Lys 


Ser 


Ala 


Glu 


Thr 
70 


Val 


Ala 


He 


Gin 


Pro 
75 


Asp 


He 


Phe 


Ser 


His 
80 


Asp 


Thr 


Gin 


Gly 


Leu 
85 


Leu 


Arg 


Lys 


Lys 


Leu 
90 


He 


Glu 


Ala 


Ser 


Phe 
95 


Gin 


Lys 


Val 


He 


Leu 
100 


Asp 


Gly 


Tyr 


Gly 


Ser 
105 


Cys 


Gly 


Pro 


Gin 


Asn 
110 


Leu 


Asn 


Leu 


Arg 


Lys 
115 


Glu 


Trp 


Glu 


Ser 


Glu 
120 


Gly 


Lys 


He 


He 


Leu 
125 


Trp 





















<210> 30 
<211> 519 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 4834217CD1 

<400> 30 



Met 


Ala 


Ala 


Glu 


Ala 


Ala 


Asp 


Leu 


Gly 


Leu 


Gly Ala 


Ala 


val 


Pro 


1 








5 










10 










15 


Val 


Glu 


Leu 


Arg 


Arg 
20 


Glu 


Arg 


Arg 


Met 


Val 
25 


Cys 


Val 


Glu 


Tyr 


Pro 
30 


Gly 


Val 


Val 


Arg 


Asp 
35 


Val 


Ala 


Lys 


Met 


Leu 
40 


Pro 


Thr 


Leu 


Gly 


Gly 
45 


Glu 


Glu 


Gly 


Val 


Ser 
50 


Arg 


He 


Tyr 


Ala 


Asp 
55 


Pro 


Thr 


Lys 


Arg 


Leu 
60 


Glu 


Leu 


Tyr 


Phe 


Arg 
65 


Pro 


Lys 


Asp 


Pro 


Tyr 
70 


Cys 


His 


Pro 


Val 


Cys 
75 
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Ala 


Asn 


Arg 


Phe 


Ser 
80 


Thr 


Ser 


Arg 


Thr 


Arg 


Arg 


Gin 
95 


Lys 


Gly 


Glu 


Val 


Thr 


Phe 


Asp 
110 


Met 


Glu 


Tyr 


Lys 


Phe 


Gin 


Gly 
125 


Met 


Ser 


Thr 


Glu 


Ala 


Gly Gly 


Lys 


His 










140 






Met 


Leu 


Arg 


Pro 


Glu 
155 


Lys 


Glu 


Leu 


Tyr 


He 


Pro 


Pro 
170 


Pro 


He 


Asp 


Tyr 


Phe 


Tyr 


Arg 
185 


Pro 


Glu 


Asn 


Pro 


Pro 


He 


Ser 
200 


Gly 


Glu 


Arg 


Arg 


Pro 


His 


Asn 
215 


Ala 


He 


Val 


Pro 


Lys 


Gin 


Pro 
230 


Leu 


Glu 


Val 


Cys 


Thr 


Asn 


Pro 
245 


Val 


Asp 


Lys 


Leu 


Phe 


Asp 


He 
260 


Arg 


Pro 


Ala 


Asn 


lie 


Ser 


Val 
275 


His 


Pro 


Phe 


He 


Ala 


Tyr 


Tyr 
290 


Met 


He 


He 


Arg 


Phe 


Gly 


Tyr 
305 


Asp 


Pro 


Tyr 


Gin 


Val 


Leu 


Asp 
320 


Phe 


Arg 


Tyr 


Ala 


Pro 


Ser 


Asp 
335 


Leu 


Pro 


Asn 


Tyr 


Ser 


Leu 


Pro 
350 


He 


Thr 


Val 


Thr 


Met 


His 


Asp 
365 


Leu 


Lys 


Ser 


Gly 


Ala 


Arg 


Lys 
380 


Pro 


Ala 


Ser 


Val 


Tyr 


He 


Phe 
395 


Arg 


Glu 


Met 


Phe 


Tyr 


Gin 


Leu 
410 


Cys 


Asp 


He 


He 


His 


Arg Asn Asp Gly 










425 






Asp 


Gly 


Trp 


Cys 


Leu 
440 


Pro 


Lys 


Met 


Ser 


Leu 


Met 


He 
455 


Arg 


Gin 


Leu 


Phe 


Ser 


Ser 


Ser 
470 


Ala 


Lys 


Thr 


Tyr 


Glu 


Ser 


Gly 
485 


Glu 


Asp 


Glu 


Glu 


Glu 


Glu 


Asp 
500 


Phe 


Lys 


Met 


Glu 


Thr 


Glu 


He 
515 


Leu 


Asp 



<210> 31 

<211> 493 

<212> PRT 

<213> Homo sapiens 

<220> 



Ser 


Leu 


Leu 
85 


Leu 


Arg 


He 


Arg 


Lys 
90 


Val 


Leu 


Gly 
100 


Thr 


Glu 


Ala 


His 


Ser 
105 


He 


Leu 


Gly 
115 


He 


He 


Ser 


Thr 


He 
120 


Asp 


Phe 


Gin 
130 


Tyr 


Leu 


Ala 


Val 


His 
135 


Thr 


Ser 


Met 
145 


Tyr 


Asp 


Lys 


Val 


Leu 
150. 


Ala 


Phe 


Phe 


His' Gin 


Glu 


Leu 


Pro 






160 










165 


Phe 


Ser 


Arg 
175 


Leu 


Asp 


Ala 


Pro 


Val 
180 


Thr 


Gin 


His 
190 


Arg 


Glu 


Gly 


Tyr 


Asn 
195 


Asn 


Leu 


He 
205 


Gly 


Leu 


Ser 


Arg 


Ala 
210 


Phe 


Val 


Asn 
220 


Phe 


Glu 


Asp 


Glu 


Glu 
225 


Ala 


Ala 


Ala 
235 


Gin 


Thr 


Trp 


Arg 


Arg 
240 


Arg 


Lys 


Val 
250 


Glu 


Glu 


Glu 


Leu 


Arg 
255 


He 


Trp 


Ser 
265 


Arg 


Asn 


Ala 


Val 


Lys 
270 


Asp 


Lys 


Leu 
280 


Lys 


Val 


Leu 


Leu 


Pro 
285 


Thr 


Gly 


Pro 
295 


Trp 


Arg 


Ser 


Leu 


Trp 
300 


Arg 


Lys 


Asn 
310 


Pro 


Asp 


Ala 


Lys 


He 
315 


He 


Arg 


Cys 


Gly Met 


Lys 


His 


Gly 






325 










330 


Val 


Lys 


Ala 
340 


Lys 


Arg 


Ser 


Thr 


Tyr 
345 


Val 


Lys 


Lys 
355 


Thr 


Ser 


Ser 


Gin 


Leu 
360 


Gin 


Gly 


Leu 


Gly Pro 


Ser 


Gly 


Thr 






370 










375 


Ser 


Ser 


Lys 
385 


Tyr 


Lys 


Leu 


Lys 


Asp 
390 


Gly 


Ala 


Leu 
400 


Pro 


Pro 


Tyr 


Arg 


Gin 
405 


Leu 


Asn 


Val 
415 


Glu 


Glu 


Leu 


Gin 


Lys 
420 


Ala 


Glu 


Asn 
430 


Ser 


Cys 


Thr 


Glu 


Arg 
435 


Thr 


Ser 


Asp 
445 


Glu 


Leu 


Arg 


Asp 


Thr 
450 


Thr 


He 


Arg 
460 


Ser 


Lys 


Arg 


Pro 


Ala 
465 


Ala 


Asp 


Gly 
475 


Gly 


Lys 


Glu 


Gin 


Leu 
480 


Glu 


Glu 


Asp 
490 


Glu 


Glu 


Glu 


Glu 


Glu 
495 


Pro 


Ser 


Asp 
505 


Gly 


Ser 


Glu 


Asn 


Glu 
510 


Tyr Val 
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<221> misc_f eature 

<223> Incyte ID No: 5156094CD1 

<400> 31 



Met 


lie 


Leu 


Asn 


Ser 


Leu 


Ser 


Leu 


Cys 


Tyr 


His 


Asn 


Lys 


Leu 


He 


1 








5 










10 










15 


Leu 


Ala 


Pro 


Met 


Val 


Arg 


Val 


Gly Thr 


Leu 


Pro 


Met 


Arg 


Leu 


Leu 










20 










25 










30 


Ala 


Leu 


Asp 


Tyr 


Gly Ala 


Asp 


He 


Val 


Tyr 


Cys 


Glu 


Glu 


Leu 


He 










35 










40 










45 


Asp 


Leu 


Lys 


Met 


He 


Gin 


Cys 


Lys 


Arg 


Val 


Val 


Asn 


Glu 


Val 


Leu 










50 










55 










60 


Ser 


Thr 


Val 


Asp 


Phe 


Val 


Ala 


Pro 


Asp 


Asp 


Arg 


Val 


Val 


Phe 


Arg 










65 










70 










75 


Thr 


Cys 


Glu 


Arg 


Glu 


Gin 


Asn 


Arg 


Val 


Val 


Phe 


Gin 


Met 


Gly 


Thr 










80 










85 










90 


Ser 


Asp 


Ala 


Glu 


Arg 


Ala 


Leu 


Ala 


Val 


Ala 


Arg 


Leu 


Val 


Glu - 


Asn 








95 










100 










105 


Asp 


Val 


Ala 


Gly 


He 


Asp 


Val 


Asn 


Met 


Gly 


Cys 


Pro 


Lys 


Gin 


Tyr 










110 










115 










120 


Ser 


Thr 


Lys 


Gly 


Gly 


Met 


Gly 


Ala 


Ala 


Leu 


Leu 


Ser 


Asp 


Pro 


Asp 










125 










130 










135 


Lys 


lie 


Glu 


Lys 


He 


Leu 


Ser 


Thr 


Leu 


Val 


Lys 


Gly 


Thr 


Arg 


Arg 










140 










145 










150 


Pro 


Val 


Thr 


Cys 


Lys 


He 


Arg 


He 


Leu 


Pro 


Ser 


Leu 


Glu 


Asp 


Thr 










155 










160 










165 


Leu 


Ser 


Leu 


Val 


Lys 


Arg 


He 


Glu 


Arg 


Thr 


Gly 


He 


Ala 


Ala 


He 










170 










175 










180 


Ala 


Val 


His 


Gly 


Arg 


Lys 


Arg 


Glu 


Glu 


Arg 


Pro 


Gin 


His 


Pro 


Val 










185 










190 










195 


Ser 


Cys 


Glu 


Val 


He 


Lys 


Ala 


He 


Ala 


Asp 


Thr 


Leu 


Ser 


He 


Pro 








200 










205 










210 


Val 


■lie 


Ala 


Asn 


Gly 


Gly 


Ser 


His 


Asp 


His 


He 


Gin 


Gin 


Tyr 


Ser 










215 










220 










225 


Asp 


He 


Glu 


Asp 


Phe 


Arg 


Gin 


Ala 


Thr 


Ala 


Ala 


Ser 


Ser 


Val 


Met 








230 










235 










240 


Val 


Ala 


Arg 


Ala 


Ala 


Met 


Trp 


Asn 


Pro 


Ser 


He 


Phe 


Leu 


Lys 


Glu 








245 










250 










255 


Gly 


Leu 


Arg 


Pro 


Leu 


Glu 


Glu 


Val 


Met 


Gin 


Lys 


Tyr 


He 


Arg 


Tyr 








260 










265 










270 


Ala 


Val 


Gin 


Tyr 


Asp 


Asn 


His 


Tyr 


Thr 


Asn 


Thr 


Lys 


Tyr 


Cys 


Leu 










275 










280 










285 


Cys 


Gin 


Met 


Leu 


Arg 


Glu 


Gin 


Leu 


Glu 


Ser 


Pro 


Gin 


Gly 


Arg 


Leu 








290 










295 










300 


Leu 


His 


Ala 


Ala 


Gin 


Ser 


Ser 


Arg 


Glu 


He 


Cys 


Glu 


Ala 


Phe 


Gly 










305 










310 










315 


Leu 


Gly 


Ala 


Phe 


Tyr 


Glu 


Glu 


Thr 


Thr 


Gin 


Glu 


Leu 


Asp 


Ala 


Gin 








320 










325 










330 


Gin 


Ala 


Arg 


Leu 


Ser 


Ala 


Lys 


Thr 


Ser 


Glu 


Gin 


Thr 


Gly 


Glu 


Pro 








335 










340 










345 


Ala 


Glu 


Asp 


Thr 


Ser 


Gly Val 


He 


Lys 


Met 


Ala 


Val 


Lys 


Phe 


Asp 








350 










355 










360 


Arg 


Arg 


Ala 


Tyr 


Pro 


Ala 


Gin 


He 


Thr 


Pro 


Lys 


Met 


Cys 


Leu 


Leu 




365 










370 










375 


Glu 


Trp 


Cys 


Arg 


Arg 


Glu 


Lys 


Leu 


Ala 


Gin 


Pro 


Val 


Tyr 


Glu 


Thr 










380 










385 










390 


Val 


Gin 


Arg 


Pro 


Leu 


Asp 


Arg 


Leu 


Phe 


Ser 


Ser 


lie 


Val 


Thr 


Val 










395 










4 0.0 










405 


Ala 


Glu 


Gin 


Lys 


Tyr 


Gin 


Ser 


Thr 


Leu 


Trp 


Asp 


Lys 


Ser 


Lys 


Lys 










410 










415 










420 


Leu 


Ala 


Glu 


Gin 


Ala 


Ala 


Ala 


He 


Val 


Cys 


Leu 


Arg 


Ser 


Gin 


Gly 










425 










430 










435 


Leu 


Pro 


Glu 


Gly 


Arg 


Leu 


Gly 


Glu 


Glu 


Ser 


Pro 


Ser 


Leu 


His 


Lys 










440 










445 










450 


Arg 


Lys 


Arg 


Glu 


Ala 


Pro 


Asp 


Gin 


Asp 


Pro 


Gly 


Gly 


Pro 


Arg 


Ala 










455 










460 










465 


Gin 


Glu 


Leu 


Ala 


Gin 


Pro 


Gly Asp 


Leu 


Cys 


Lys 


Lys 


Pro 


Phe 


Val 
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470 

Ala Leu Gly Ser Gly 
485 

<210> 32 

<211> 516 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 

<400> 32 



Met 


His 


Gly Arg 


Lys 


1 








5 


Gin 


Leu 


Gly 


JUCU 


Asn 
20 


Phe 


Gin 


Ala 


^3 X Ll 


Gly 
35 


Ser 


Val 


Sen 
noil 




Ser 
50 


Ser 


Thr 


V d -L 


Lys 


Thr 
65 


Val 


Asp 


Ser 


Leu 


Phe 
80 


Glu 


His 


Tyr 


Lys 


Cys 
95 


Leu 


His 




Th-r 

J. I IX 


He 
110 


Phe 


Lys 


Cys 


Asp 


He 
125 


Leu 


Ala 


OCX 


n x o 


Gin 
140 


Cys 


Asn 




v~_y is 


Gly 
155 


Gin 


His 


Arg 


Arg 


He 
170 


Glu 


Cys 




T ,vc 
J— »_y 


Val 
185 


Gin 


Arg 




Hi Q 


Thr 
200 


Gly 


Lys 


Val 


Phe 


His 
215 


He 


His 


Thr 


Gly 


Glu 
230 


Val 


Phe 


Ser 


Arg 


Asn 
245 


Thr 


Gly 


Glu 


Lys 


Pro 
260 


His 


His 


He 


Ser 


His 
275 


Glu 


Lys 


Pro 


Tyr 


Lys 
290 


Lys 


Ser 


Ser 


Leu 


Val 
305 


Pro 


Tyr 


Lys 


Cys 


Asn 
320 


Ser 


Leu 


Val 


Asn 


His 
335 


Lys 


Cys 


Asn 


Glu 


Cys 
350 


Ala 


Gin 


His 


Leu 


He 
365 


Asp 


Glu 


Cys 


Asp 


Lys 
380 


His 


His 


Arg 


He 


His 



475 

Glu Glu Ser Pro Leu 
490 



5665139CD1 



Asp Asp Ala Gin Lys 

10 

Pro Gin Ser His Leu 
25 

Lys He Tyr Lys Tyr 

40 

Ser Leu Val Ser Pro 
55 

His He Ser His Thr 

70 

Thr Gin Lys Glu Lys 

85 

Asn Glu Arg Gly Lys 
100 

His Gin He He His 
115 

Cys Gly Lys He Phe 
130 

Arg He His Thr Gly 
145 

Lys Val Phe His Asn 
160 

His Thr Gly Glu Lys 
175 

Phe Asn Gin lie Ser 
190 

Gly Glu Lys Pro Tyr 
205 

Gin He Ser His Leu 
220 

Lys Pro Tyr Glu Cys 
235 

Ser Tyr Leu Val Gin 
250 

Tyr Arg Cys Asn Val 
265 

Leu Ala Gin His Gin 
280 

Cys Asn Glu Cys Gly 
295 

Asn His Trp Arg lie 
310 

Glu Cys Gly Lys Val 
325 

Trp Arg He His Thr 
340 

Gly Lys Val Phe Ser 
355 

He His Ala Gly Glu 
370 

Ala Phe Ser Gin Asn 
385 

Thr Gly Glu Lys Pro 
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480 

Glu Gly Trp 



Gin 


Pro 


Val 


Lys 


Asn 
15 


Pro 


Glu 


Leu 


Gin 


Leu 
30 


Asp 


His 


Met 


Glu 


Lys 
45 


Pro 


Gin 


Arg 


He 


Ser 
60 


Tyr 


Glu 


Cys 


Asn 


Phe 
75 


Ala 


Asn 


He 


Gly 


Thr 
90 


Ala 


Phe 


His 


Gin 


Gly 
105 


Thr 


Lys 


Glu 


Thr 


Gin 
12 0 


Asn 


Lys 


Lys 


Ser 


Asn 
13 5 


Glu 


Lys 


Pro 


Tyr 


Lys 
15 0 


Met 


Ser 


His 


Leu 


Ala 
16 5 


Pro 


Tyr 


Lys 


Cys 


Asn 
180 


His 


Leu 


Ala 


Gin 


His 
195 


Lys 


Cys 


Asn 


Glu 


Cys 
210 


Ala 


Gin 


His 


Arg 


Thr 
225 


Asn 


Lys 


Cys 


Gly 


Lys 
240 


His 


Leu 


He 


He 


His 
255 


Cys 


Gly 


Lys 


Val 


Phe 
270 


Arg 


He 


His 


Thr 


Gly 
285 


Lys 


Val 


Phe 


Ser 


His 
300 


His 


Thr 


Gly 


Glu 


Lys 
315 


Phe 


Ser 


His 


Lys 


Ser 
330 


Gly 


Glu 


Lys 


Pro 


Tyr 
345 


Arg 


Asn 


Ser 


Tyr 


Leu 
360 


Lys 


Pro 


Tyr 


Lys 


Cys 
375 


Ser 


His 


Leu 


Val 


Gin 
390 


Tyr 


Lys 


Cys 


Asp 


Glu 
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395 





Glv 


Lys 


Val 


Phe 
410 


Ser 


Gin 


Arg 


lie 


His 


Thr 


Glv 
425 


Glu 


Lvs 


Lys 


Val 


Phe 


Glv 


Leu 


Asn 


Ser 








440 






His 


Thr 


Glv 


Glu 


Lvs 
455 


Pro 


Phe 


Phe 


Ser 


Met 


Arg 


Ser 
470 


Ser 


Leu 


Gly 


Glu 


Lys 


His 


Phe 
485 


Lys 


Cys 


Asp 


Asn 


Ser 


Tyr 


Leu 
500 


Val 


Arg 


Lys 


Ser 


Asn 


Thr 


Cys 


Asn 





515 







400 










405 


Asn 


Ser 


Tyr 
415 


Leu 


Ala 


Tyr 


His 


Trp 
420 


Ala 


Tyr 


Lys 
430 


Cys 


Asn 


Glu 


Cys 


Gly 
435 


Ser 


Leu 


Ala 
445 


His 


His 


Arg 


Lys 


He 
450 


Lys 


Cys 


Asn 
460 


Glu 


Cys 


Gly 


Lys 


Ala 
465 


Thr 


Asn 


His 
475 


His 


Ala 


He 


His 


Thr 
480 


Asn 


Glu 


Cys 
490 


Gly 


Lys 


Leu 


Phe 


Arg 
495 


His 


Gin 


Arg 
505 


Phe 


His 


Ala 


Gly 


Lys 
510 



<210> 33 
<211> 3163 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc^feature 

<223> Incyte ID No: 091502CB1 

<400> 33 

gtcggaggtc ttacccaaca gattgacgcg 
tgattgactg ggctggcgtt aactgtgcgg 
tcagagcttc gctgacagcc aaggatggca 
gagaactcca gtgcaggggg gcccagtggg 
caggacagca ctttcgagtg caacatctgc 
ctgtgtggcc acctcttctg ttggccgtgt 
agacaggtgt gtcctgtttg caaagctggc 
ggaaggggca v gcactgggca acaggacccc 
cagaggccag agccggagaa tagaggggga 
ttccagatgt cttttggaat tggggcattt 
ataaatgatg ggcggcctcc tccagctgtc 
ttcctgtcac gcctcttcct atttgtggcc 
taatgctggg ctcctgccta catccgtggc 
actcctggtg tttggcttcc tggctaatct 
catcaaggag tcttgtttct tcatcagagc 
cctgaatatc gccaccgctg taaacactct 
ctcatgggtg acaccatgaa -.atcttgtttc 
gggaagaggc agaaagagag aaactgtcag 
gaaacaaagg gatgcaccaa atggtatttc 
ggtaagttgc ttctgaagcc agtgggggct 
ccagtgccgc tctgttctct ttccttcccc 
atgcaagaaa ttgctgtccc ataaaaatca 
cgtgaattca ccagagactc aaagatcttt 
ggcctcagag aacaacttga atgacttcct 
acatgtggct taactcacag gtttcccatc 
ctctctctgc cagagaacat acagccgaga 
cattaggaaa gacctggagt caggactttg 
ctgaacaagc agccctgtcc cctaggctgc 
tgccacagga aactgggggc tttgtcaggt 
ctcagaagcc aagttgtctt ttatgaagag 
tgattttggt atggcttgat agagttccct 
ggaagcatgt gactgccaag caggcaaccc 
cttggggagc cccagcacaa tgatattgtg 
attattcttc ccaatacctt gatttgattt 
tattgaggga ctatggtacc aagcaggtag 
ctgccttcta ggcttcatcc cagaaatcca 
gagatgggct ttcctctggg cctctctcct 
cccagcaata accaacaaat ggtaggaagc 
gcatagttta tggaaacaaa gatcttgagg 



gcgttagtat tggccgtgta cccgaaaaac 60 
aggcaggatt tccctgggga aagtcttcac 120 
agcaaggggc cctcggcctc tgcatctcct 180 
agcagcaatg gcgctggcga gagcggaggg 240 
ttggacacag ccaaggatgc cgtcatcagc 300 
ttacatcagt ggttggagac cagacctaac 3 60 
atcagccgag acaaggtcat ccccctctat 420 
agagagaaga cccct.cc teg tcctcaagga 4 80 
tttcaaggat ttggatttgg agatggtggc 540 
ccctttggga tatttgecac agcatttaat 600 
cctgggacac cccagtatgt ggacgagcag 660 
ctggtgatca tgttctggct cctgattgcc 720 
agggctctgg actggtgacg tgccacccca 780 
tgactcctgg aatcagtggg atcagtaaca 840 
tttggaactc gagaccagtt ggegatgace 900 
ataacttcag gecttggcat tgagtcatct 960 
agecagttet gcaggtcctg actctgeaga 1020 
agtataattt cacctgagtt taatattaca 1080 
tggaaatttt catgtcttta aatacccctt 1140 
cctcagatag agaggttccc ctttcaaatc 1200 
tcccactccc cctcttcttc ctctgtagag 1260 
taattgcagt agctaaagct ggggtcactt 1320 
tattggctct gggctgtgct cagtgtcttt 1380 
ggtttcctgg cataaattat tcctggtgag 1440 
agctttctcc ctaaaactat gttcatctgc 1500 
atactgeega agctgagact gactactgtg 1560 
gtgggatttg gagctccgag gcagtaataa 1620 
agaagcttga atgcatcctc tcccagaacc 1680 
cagcccaact geatgeaaaa gaccaccatc 1740 
gcaaggaaag gggaaaccca catgtgaccc 1800 
gaaaactcct tgtatgtgtg ctaaaaccag 1860 
ctgatgattt gtaaagccag gtggcagggc 1920 
tggtcttccc tcctgtggaa tcgaggggaa 1980 
tcagtttcat aagcttcttc ctctgaatct 2040 
gactgttcac ctggtggaac agttcttgct 2100 
gcctctttct ggagacccca aagctggagg 2160 
actttgecat ccacactgct cctggctaac 2220 
cccatctatt gctttttctc aattatgact 2280 
aagatgaggg aagccctccc ctcttcacag 2340 
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tccccatctc 
tataaagtat 
ctttatagtg 
ttatcacctt 
agaattggtg 
agtcagatgg 
aaatgaatca 
aggcactctg 
tcgtaggtcc 
ctcctggctc 
tcatccttct 
accctcctcc 
gatatgtaca 
atttagaatt 



ttctcctttg 
tgagtaggtg 
tgcctgatag 
ctggaaatcc 
ggacccccct 
actcaacaga 
tttggactct 
ggaacctgtt 
ccactcaccc 
ggccccaaca 
gccaaggcag 
tgccaccctt 
gtggcacacc 
aaacaaatgg 



tacctgtcaa 
gttcatatgc 
attttaaaca 
gcaaagttgc 
cagtgcagtg 
aatgggtttc 
tcaatgaaat 
agtaaagcca 
tctgccagga 

gggsgccccc 

ctttcctttc 
gtggactagg 
ttaaccagtc 
ttttacatta 



<210> 34 
<211> 2185 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 



763816CB1 



<400> 34 
ggatctccag 
cagcggaaaa 
ggttctcctg 
gagagccgaa 
tggaagctgt 
ccaaaacagc 
ggaagcatgc 
ctgttgagct 
tggttcagga 
cctcaaagaa 
agtacataaa 
aagagcaggt 
gtccagagca 
aagaggagat 
gctgtcgtgc 
atagccctta 
atcatgtagc 
atttcccaag 
caaataaaaa 
-acagtgtgaa 
tatcagatag 
cagttgtgtt 
atcttgtaag 
aatatgtatg 
gtactcaaag 
atattagttg 
ttacatgtat 
tcttaatcag 
ttgctgattt 
tcccacagga 
ttcagtggag 
caccccccac 
ttagagtttg 
gttgataagt 
tgttaaagaa 
tgttcatgga 
gaaaggt ttt 



agctcagaaa 
cttctcctgc 
atctctggaa 
aaccatgtcc 
tctctctatc 
ccctcctgct 
ccttttccct 
tggtgatgaa 
agaacttcta 
gctcaagaaa 
aacagatttg 
tgatgtggaa 
gtctagaaat 
tgcaaaatac 
tatgaaaaaa 
caaatgcaca 
agcccatggg 
aggttttaag 
gctaatggaa 
tatttgttct 
catagttgga 
actaagaatg 
tcaataaatt 
aataataata 
cattcagtta 
aaatgtttaa 
ttttaaatca 
cttctagatg 
tctgctaatg 
gacaagtgaa 
tcttccattc 
ccaccgcctg 
tatatcctac 
aggaactgaa 
atatatcgat 
ttattttatt 
catgtcaaaa 



accctcaggg 
ttcacttgat 
gtcttccttt 
ttatgggcca 
gatac tgagc 
tctccagaag 
gaactcccca 
ctacaaatag 
gcttcaccta 
gacaaccaag 
gatgcgatgg 
tccattgatt 
gtgctacagt 
atgaagcg.tg 
ggtgctgttt 
atctgtggaa 
caaagtttac 
aaacatttaa 
gctct tgaac 
acaaaggtgt 
tcagtagatg 
agcatttgat 
tctgtatagt 
caaggaagta 
aagtgtatct 
gaattaggca 
taagagttat 
tttattagct 
ggaaaaattg 
agataagtgt 
tgtatcttac 
ccagctcaca 
tcctgggcct 
agtcttccag 
tttatgtttt 
ctaaaatatt 
aaaaa 



caccagagtt 
cgaatccact 
ttcctgggca 
aggggcctct 
gccccaacta 
cagaagaata 
ggagtgagcc 
ggctggccaa 
gctaagtaag 
ttcccaccat 
ttttgtgtgt 
accaggtcct 
actaattttc 
ctatgaaaaa 



cagtgtttaa 
tggtaggaag 
cccactcaag 
ggagtgtctc 
gtggagggag 
atgaaaagtt 
caggagagct 
atgccat t tg 
gcaggagagc 
ccctcggcaa 
tttctgtgtt 
gaccccagtc 
actgttgtga 
aaa 



acagacaaaa 
aaaccaccag 
agtgctc ttt 
ttctctagag 
agaggac tta 
gtgggtagga 
cagccaacag 
attttgaacc 
tgacttggga 
gc tcaccacc 
cttagcctcc 
agaaaatgat 
aagtgatttg 



tcaccagatc 
ttccctgagt 
tt tattgagc 
tctgaggccc 
ctagaaaacc 
cacgcaaacg 
aatctgctct 
atgccataga 
agaaactctt 
agagctcaga 
atattaaggg 
ttagcaaaga 
ttactgaaga 
gaaaaggaaa 
tgcatcattt 
aggcttttct 
ttaaatgtcc 
ctcattgtca 
cgccactgga 
ttgttggaac 
acatgtatgg 
catttttttc 
ccagatggat 
ggcattccat 
gtgtgtggaa 
tgaaaaataa 
tttctatctg 
tttatgtcat 
actaagtctt 
gattttagat 
cctaagatct 
ctaatagatg 
tacatgtcgc 
attcacagta 
tcaattatgt 
ttgtcaaatg 



tttggaagct 
cccagaaaag 
ctcagaaacc 
caaaagcagc 
tgccctgt tt 
tgcccttt tt 
attctcagaa 
tga tcaaaaa 
agaagatact 
cgctgagc tt 
ccaggaatca 
gaacaaaatg 
aaaagaagct 
gtattattgc 
ggttaataag 
tttggaatct 
acgttgtaat 
aagccggcat 
ggagcagcaa 
cattctttgt 
tgtaccgtgt 
tggtctctgt 
taaacttctc 
ttaataatca 
ctaatttcag 
atttgagaaa 
atgtaaaatt 
gaaatgttgg 
taaaatagtt 
ctttcttgtc 
ggttcttccc 
attcttaatt 
ctgttggggc 
gaaaatttta 
tactgtaaat 
tgtatcaacc 



ttctcctgat 
ttcccgtggt 
tgtcttccct 
ctcagatatc 
cccgagcctg 
ccagagcccc 
tcacagaagg 
tgtgatattt 
ttatttcctt 
agtagtagtg 
agcagtgatc 
gacatgacta 
tttatctctg 
aaaatttgtt 
cataatgttc 
ctccttaaaa 
tttgaatcaa 
aatgaagagg 
atttgataac 
aagtatagct 
ttcac tgtct 
ctatgtgact 
atttctttta 
agagcaagtt 
acaatagaaa 
ttttgtttcc 
agtttataaa 
agtctcaggg 
tgcagcc ttc 
catagttgtt 
tccccatccc 
gccaaatgtg 
ttaagaccag 
tagacatttc 
accttgtacc 
aaattaaaaa 



2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3163 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2185 



<210> 35 

<211> 2408 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> unsure 
<222> 2243 

<22 3> a or g or c or t, unknown, or other 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 9611B4CB1 



<400> 35 

ggagcatccg 

c tgggaccag 

ccctgcgatc 

ctaccatgcc 

acccccaggc 

tgagctggcc 

ctgggcttcc 

ctccagaaac 

ctctcccctt 

cactccaact 

ccaccacccc 

cagctcacct 

aattctataa 

gggagaagct 

tgccggggcg 

tcctacttct 

tgatgagcaa 

agagctcccc 

tggacgaaca 

ttcaggcccg 

gccaacacct 

acctccagag 

gccacatgtc 

tgcccctggg 

tgcacgcatg 

agtcactgcc 

gcccagggct 

accacatgtg 

gaccgcgtgt 

ccccatagag 

ccactttcag 

ctggctttca 

gggtgaaggg 

ttgaatgcta 

aggaaatata 

tttacatagt 

aaatgttttc 

agagaaactg 

actgcctatt 

taagtgaaat 

aggatttt 



gacgctccag 
gagcctccca 
cactggcagc 
cttcctcagc 
cagcttggtg 
gccatcgggg 
cgaagtggcc 
cagcaccaac 
gcctattctg 
ttctgtagaa 
agaaacatcc 
gaccccctcc 
ctttgtgatc 
ggaggccctt 
cggggagctg 
cacc tccaac 
cctcacgcga 
ggcccagctc 
ctcccagatc 
aaaggccatg 
ggacggtgag 
ctacttgtcc 
atttgggact 
agccccgcca 
gcaggctggc 
cttcccgcag 
gcaacccctc 
gaaccagaga 
ccttgcctga 
caccccggtc 
gacattgtcc 
gggtggtcag 
tcttggtggt 
aaccatatca 
tctgtgtaaa 
tttgagtatt 
atgaggaagg 
atggggaagg 
ttaaaaccgt 
tggttagaat 



tccaatctgg 
cgccccattg 
cctgcctccc 
ctgcaccgca 
cccgagcctg 
gagattgcca 
ccagatgcaa 
tacccagtgg 
gagccggtca 
gataccacct 
cctcctcctc 
tccctgttcc 
ctccacgcca 
ggcgtgcccg 
agctgcctgc 
ttcgactgtc 
caggggtcgc 
agctccgaca 
ttcgccagga 
tggaggaagg 
cggatgcagg 
taccaggcac 
ggggcgccct 
ccctttccca 
acccccccac 
tccccagcct 
attatccacc 
gggtcccagg 
ccacctgggg 
tgccctgtgc 
gggagccctt 
aactggatac 

gggggtattg 

aacacttccc 
taacaatctg 
attttaagat 
agaatgttgt 
ggnatcgact 
aagccaaatt 
ctgggtttaa 



gctgcctccc 
acggtgtttc 
tggccagcaa 
gcccacatgg 
tccccggtgg 
gcccaccaga 
cctccactgg 
agtgcaccga 
aaaacccctg 
ctccaaatac 
ctcctcctcc 
cttcctccct 
gggcagacga 
acggggccac 
aggacgccat 
gcctgagcct 
cagactgtgt 
cggccagcct 
aggtggccaa 
aacaggacac 
cggcggcact 
agatggagca 
atggggctcg 
cttggccggg 
cgccctcccc 
tccctacggc 
acgcacagat 
cgcccgagga 
aacacccctg 
cctgtggaca 
catttaggac 
ggtgtttaca 
ctacggtctt 
aaaatttaca 
cc tatacatt 
tgcatctttt 
gtgtttacag 
ttataggcag 
tgtattgtag 
aaacttttta 



accatcctcg 
ggactggagc 
cttggaaatc 
gcccagcaag 
ctgccaggag 
gctgccaagc 
cctccc tgat 

ggggtctgca 

ctctgtcaaa 
caagccgtgc 
tccttcatct 
ggaatcatca 
acacatcgcc 
c ttctgcgag 
agaccactca 
gcaccaggtg 
catcccc ttc 
gctctccggg 
caccttcaag 
ccgagccctg 
gaacgcagcc 
gctccaggtg 
aatgcccttt 
gtgcccgcag 
acagccagca 
ctcacccgca 
ggtacagctg 
caagacgcag 
gacccaggca 
gtggaagatg 
aaaacgggcg 
attccaatct 
ttaattataa 
agcaagagag 
gtcttataaa 
ccagtgttcc 
agccaatgta 
taaagtattt 
ttaatccaaa 
attttataaa 



gctttgccct 
caagggtgct 
agccagtccc 
ctctgtgacg 
cctgaggaga 
agcccacctc 
acccccgcag 
ggcccccagt 
gaccagacgc 
ccacctactc 
actccttgtt 
tcggaacaga 
ctgcgggttc 
gatttccagg 
gctttcatca 
aaccaagcca 
ctgcccctgg 
ctggtgcggc 
ccccacaggc 
cgggaacaga 
tactcagcct 
gcttttggga 
gggggccagg 
ccgccacccc 
gcctttccac 
ccccctcaga 
gggctgaaca 
gaggcagaat 
tcggccagga 
aggtcatctg 
cgatgatgcc 
ctctatttct 
taaatattta 
agttaaaatt 
ttaatttctc 
ttttgtcaaa 
tattaaattg 
acaaaccatg 
ttggtcctgt 
ataaccaggg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2408 



<210> 36 

<211> 2564 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^f eature 

<223> Incyte ID No: 1255525CB1 



<400> 36 

ccaacctcat tttttctctg ttgccacagg 
atcctgttat ttccctatcg attttacccc 
agaagacctc atcctgagtc ttcgcgctgt 
tctgctgccc ctgttgattg agaaagtgga 
tctacagact ctgaatgctt gctgtgctgt 



accctttgtg gaggagttgt ttgaagtgac 60 

tccacctaat gatccccatg gtatccagag 120 

gctggcttct acaccacgat ttgctgagtt 180 

ttctgaggtt ctgagtgcca agttggattc 240 

gtatggacag aaggaactga aggacttcct 300 
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ccccagcctt 
gctcctgggt 
tctgaatggc 
cacccagctt 
cctatcttat 
ggaggattcc 
ctaccctgtg 
ggagtcaaat 
gcaagccttg 
gctgcagcat 
tgctgtctgt 
ctggtatttc 
gccagagaag 
catggtgtct 
tgtgacacac 
cttcccgagc 
actgcttatg 
actcatgcgg 
tgctgccaag 
attcctacag 
tcaggccttc 
cagctcctgc 
agcagctgat 
ccatgccgaa 
ggtccagggc 
tcatgtactt 
cttgctgctg 
ccttcagcct 
caccaagttt 
gtgcatgcat 
gattcgggcc 
gtcagccaga 
cctagactgt 
cagggaaacc 
tgttgcatgg 
tgtgtgtgag 
tgtcagaaaa 
ccagggctgg 



tgggcttcta 
ttcttgaagc 
ttcaaggacc 
cagcttgttg 
gaggacttgg 
cagagttgca 
gccttcagca 
ttgactaacg 
tcagctgtat 
ctctggcaag 
cagagcctca 
caccagacag 
gagccctcag 
gtcattggca 
attgtgcccc 
agattccagc 
gcctttgtct 
gagcttttgg 
tgctttgcag 
c tagctgtgg 
actct tcttc 
c ttacagccc 
ggcttctctc 
gtgcggatca 
ttccatgctg 
aacaggctgc 
gaggccctgt 
cttctactgg 
ctgaacctca 
gctctcactc 
ttagccaaac 
ggggagtggt 
tctgacaatc 
actggtctct 
ctgtacaaag 
acacttgcgt 
ggggcgatat 
cagatgttgt 



tccgcagaga 
tgcagcagaa 
agctgtgctc 
gcatccgtac 
agctggcagt 
gggtggcagc 
gccacctcgt 
gagatgagcc 
caacacatcc 
tgaacagagg 
gacagatggc 
ctataccttg 
ttctgagaaa 
ctgctacaac 
tcttcttgga 
cat tccagga 
gctccctgcc 
aactgagctg 
gactcctcaa 
acaaagtgga 
tctgggtaac 
ggctcatggg 
tgc tcatgtc 
tgt tccgcca 
ctccccaaga 
ccaagcctgt 
cctgccctga 
aagcacccca 
gctctagccc 
gcctgcccac 
ccctggatga 
ttctgttggg 
taacc tggga 
gactgccttt 
aacataagag 
ttgaagaaag 
gctgctgagg 
ctacctgctg 



gagcagccag 
atggagctat 
ac tggtattc 
actgacagtc 
gggtcacctg 
actggaagca 
acccaagctc 
cacccaatgc 
cagcatcgtc 
gaatatggtt 
agaaaaatgt 
cctgcttgcc 
agtactattg 
ccacctgagc 
tggcaacgtg 
tggctcc tea 
tcgaaatgtg 
ctgccacagc 
caagcaccct 
ggctggcctg 
aaaggeccta 
cc tcctgagt 
tgac tgcac t 
gcggttcttc 
tgtgaagcca 
actcttgcca 
ctgtgtggtg 
agtcatgagt 
ttccatggct 
ccctgtgctg 
caagaagaga 
gagccctggc 
ttactaactg 
cccacagaca 
tccatatttc 
atctagggtc 
ggtgagtgca 
aagaataaag 



eggeggacaa 
gaagacaaag 
atggctctaa 
ttgggtgccc 
tacagactga 
tcaggaaccc 
gctgaggagc 
tcccggcatc 
aaggagacac 
gcacaatcca 
cagcaggacc 
ttggctgtgc 
gaggatgagg 
cctgagttag 
tcctttctgc 
gggcagaggc 
gaaatccctc 
tgcccctttt 
gcagggcagc 
ggctctgggc 
gtgetcagat 
gacccagaat 
gatgtgctga 
acagataatg 
aactacttga 
gagctgccca 
cagctctcca 
cttcacgtgg 
gtceggateg 
ctgccgtaca 
ctggtgcgca 
agctgagccc 
ttgagecate 
cagcacaaat 
tagtggattt 
ctgggtctct 
tatgagtgtg 
atcg 



tccttgaaat 
ateaaaggee 
cagaccccag 
agecaga tct 
gcttcctgaa 
tggctgetet 
tgcgtgtagg 
tgtgctgtct 
tgcctctgct 
gtgacgt tat 
ctgagagttg 
aggectctat 
tgttggctgc 
ctgcccagag 
ctgaaaacag 
ggctgat tgc 
agctgaacca 
cttccaccgc 
agctggatga 
ectgtegtag 
accatcctct 
taggtccagc 
ctcgtgc tgg 
tgcctgcttt 
agggtct ttc 
cgcttctttc 
ccctcagctg 
acaccctcgt 
ccgcactgca 
aaccacaggt 
aggaagcagt 
tcagtcc tgg 
ttccccaaag 
gctaggcctc 
gtaaaataag 
tgcatttata 
gecctgagga 



360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2 2 80 

2340 

2400 

2460 

2520 

2564 



<210> 37 
<211> 957 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1297447CB1 



<400> 37 

ccttctgggc atgtctgcca tatggctcca 

ggagggctcc cggcatctcc tggegtcegg 

gcgcactgca gcacaactag agatgtaegg 

ggtacagccg cgagaaagag tcaagaacag 

ttgctgcgtt gacaacagac tacaccctca 

aagatgatca catcccaggg atcagtgtcg 

gaggagtggc agcatctgga ccctgctcag 

aactacagcc accttgtctc agtagggtat 

ttggagaaag gegaggagee atggatatta 

ggtgagttag tatgtgccag atggaattta 

gataataaga ccattgaaat gttctttagg 

aagtaacatg ttgacgtgac ecaaatatge 

tatcactttc atacacacat cttgttgttt 

tagecagaga ttaataaaat tttatatata 

tgaatatata aagtgatttt tcaacacccc 

aaagagcatc cctccttttg taaaattaaa 

<210> 38 



ggtttgtttt tctccccggc actctgaegg 60 
gtagaggacg eggaggatge tgagctgctg 120 
atgcccccat cttgatctta cagaatcaga 180 
acagagtege ttgaggactc aggagggtgt 240 
cagtttgetc tgctcttcca acaccagtgg 300 
tttagggatg tgactgtggg cttcactcaa 3 60 
aggaccctgt acagggatgt gatgetggag 420 
tgcattccta aaccagaagt gattctcaag 480 
gaggaaaaat ttccaagcca gagtcatctg 540 
aaggaaggta gatcacaaag ggtaagtttg 600 
aatcatgttt tagaggctcc agacctttgg 660 
agtgactctg aatcacccag catctcccag 720 
tttttaaatt gtgatataaa ctatatagtt 780 
tatgtaacac tcatgtaccc atcattgaga 840 
cagaaagctc cctgttgtac tcttcccagt 900 
tatatttttc tataaaaaaa aaaaaaa 957 
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<211> 2701 
<212> DNA 

<213> Homo sapiens 
<220> 

<22l> misc_feature 

<223> Incyte ID No: 1441094CB1 



<400> 38 

gcggcagtga gtatgtgtgt gacggacccc 
cctccttctt gcccgccggg ctgtggaact 
tcccgtccgc ccctcactcg tggcagggcg 
atgatgtttg gtggctatga gactatagaa 
tcatctagtg aactgagtgt tgatagtgag 
tatgcccaag atcttgatga tgtcatcagg 
aattcggaat cttcgagtag taaaccaaat 
gaggccatcc agctgtcaga tgggtcagag 
tatagatgta aaggaaagaa tgttagagtt 
tcttctcttc aatctaatga gctggttgat 
aaatctgaag agagatcagg tgtaatccga 
gaagaggaag agagcaccat ttcagaaggt 
tgtgaagtag atgataaaga tgatgatatc 
gttactgaag gagaagatgg tataaactgg 
atagctaata accgaacacc tggaagatgg 
attatctgta gaaattgtga caaacgtggt 
aaagttcgtc gctgcttcct gtgctccagg 
cccctttgtg aatactgtcc tgtgcctaag 
tcctgggata aacagtgtga ccgatgtcat 
gaaatctgga ggcagtatca cctaacgacc 
ccttcaagac catcagcctt agcatattgc 
cacgaatgtc cagaaagaga agtgtatgac 
t'atgatgaca aatatgaaat tcaggagaga 
ctcaagaaaa atggggttat cccagagcca 
gagaaccccc accatgatat aaggaagggc 
caagaaaata aagaaacaca aaaagaaatg 
aggaaggctg acagacatcg tgaagtggat 
tcttctcctg gcagttttaa aacccagaag 
taccacacgt caagagaaga caagtctccc 
aaggagaggt gctgggaaga tgatgacaat 
aaaaagtctt aagccgtcag gcagcctctg 
gtgtctataa ccttctggca ctgtgtttat 
ttttgttttt ttaatttcag ccattcctag 
ctctgtgtcc aacaggatat taggtaagaa 
tccaatatgt catctttact cagaatccta 
aaagtgtttt aaaaatactt tgggaaaaaa 
aaatccttgc aggaagagaa attaacacta 
tttttttttt tttttacttt ggaaaggaca 
tctgctgaag ctgttgtcag aatcttcctt 
acaaaagtgt catcattgaa gccctgaaga 
gaaaagaact atccagtata aatgtcagaa 
tcaaaaattt taacataatc acaatgaaat 
aagcggactg tcactcagat ctgcagagat 
ttgcattttt ttcaactacc gcaaagctat 
tacttctcat aacctaatgt cagactaaga 
a 



cgccgcccgc ggctcgggac ccctgcctac 60 
agcgcgtgcc ccctcgccgg cctcgacgtc 120 
cagctcctgc ttcaaggtta ctgacttttt 180 
gcatacgaag atgatcttta tcgagatgag 240 
gtggaatttc aactctatag ccaaattcat 300 
gaggaagagc atgaagaaaa gaactctggg 3 60 
cagaagaagc taatcgtcct ttcagatagt 420 
gtcatcactt tgtctgatga agacagtatt 480 
caagcacaag aaaatgccca tggtctttct 540 
aagaaatgca agagtgatat tgagaagcct 600 
gaggtcatga ttatagaggt cagttcaagt 660 
gataatgtgg aaagctggat gctactggga 720 
cttctcaacc ttgtgggatg tgaaaactct 780 
tccatcagtg acaaagacat tgaggcccag 840 
acccagcggt actattcagc caacaaaaac 900 
catttatcaa aaaactgccc cttaccacga 960 
agaggacatc tcctgtattc ctgtccagcc 1020 
atgttggacc actcatgtct tttcagacat 1080 
atgctaggcc actatacaga tgcttgcaca 1140 
aaacctggac cacccaaaaa gccgaagacc 1200 
tatcactgcg cgcaaaaagg ccattatgga 1260 
ccgtctccag tatctccatt catctgctac 1320 
gaaaagagac taaaacaaaa aataaaagta 13 80 
tccaagctac cttatataaa agcagcaaat 1440 
cgtgcctcat ggaaaagcaa caggtggcct 1500 
aagaacaaga atagaaactg ggagaagcac 1560 
gaggattttc ccaggggccc caaaacctac 1620 
ccttctaagc cctttcaccg ttcatcacat 1680 
aaggaaggca agaggggcaa gcagaagaaa 1740 
gataacttat ttcttattaa gcagagaaaa 1800 
atgtggcttt tcattgttca tttggccttt 1860 
tatctatgat taaataaagt gagtttttgg 1920 
agttactgaa tatccatgga gatctcaatt 1980 
agtacaaaga taaacctgga cttctcccat 2040 
gggataggta gaagaattca tcttttcaag 2100 
actgcatcaa aggtaattta tcctcaaatt 2160 
agaaaaagtc atcaatattt ttcaactttt 2220 
ataactataa acctatatcc agattttctt 2280 
tggacaaaac atcactagct gactataaaa 2340 
ggcagggaat tgagcttcag caaaatacag 2400 
gacagatttc ctaaacaagt aaaagagaca 2460 
cattttttac cacttttaca gcggtgtttc 2520 
gaatattact caaaaaattt ttttgttctc 2580 
acagattttt ttgtacttgt ggatcttttg 2640 
aaaataaaat atctgagagt aaaaaaaaaa 2700 

2701 



<210> 39 
<211> 2517 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1479382CB1 

<400> 39 

ggagaaaacg acttcagtgt catgtactcc acaaggaaaa actgtgctca actctggctg 60 
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ggtcctgctg 
cgagatacag 
tattatggag 
agacggggca 
aatagcaaat 
gacagcagca 
caggaaaaaa 
aagagcagaa 
tcatctaagc 
aagcctttac 
gcttcaagaa 
aaaaatttgc 
agcgggtgct 
cattcgcagg 
acaaatctga 
agcagtgtga 
caggaggaag 
gacacaggca 
atagaggaat 
atgggtcccc 
acagactgtg 
aaagacagct 
cagttaatcc 
agcagcagca 
aagttaagca 
tttaactcag 
gaaaataggg 
ctttctctct 
acagatgatt 
gaagacgatt 
tctatagata 
taagctcttg 
caattattcc 
atactattgt 
ttttatcgta 
gttgtaaaat 
tgaaaacagt 
atatatgact 
tatgctaata 
cttggagttt 
ttaatatcgc 



cgtttataaa 
catgtgtgaa 
atgggttctt 
ctggtgcttt 
atggactcag 
aaaattcaga 
acaa tgcaac 
cgttaacgag 
taactcatat 
tttcaaagat 
aactcgaaat 
ccattaaaaa 
tgactagaca 
gggagagctc 
aggaggcctc 
cggaaccttg 
aactggctca 
tgtccaaaaa 
ctactccagt 
attc tgacca 
ctccttcacc 
ttagaactgc 
tagaaaataa 
aaacaaatga 
aagaccatga 
gatcaggcag 
ggtcttatac 
tggagtctcg 
cctcctcttc 
ttattcctct 
ttgacatttc 
gtcttaactt 
tcaactgaaa 
agaaagtgta 
ctttttaaat 
aaaggctcaa 
aatgtggcta 
aaacaataat 
tcaccagcaa 
tgtcagtttc 
aaaccattgg 



ccatgattgc 
ggctctaaga 
tggagaaaat 
taaatccaga 
agaaacagat 
cagtcaatct 
ttc taaccga 
gcaatctatg 
aaataattcc 
aaaattgaga 
gggaaactta 
agataaggag 
cgcggcgaga 
gccctgcacc 
tgacatcaag 
ccccgacagt 
tgagactgca 
gaagtcacga 
gcacgattct 
gggtgagcac 
cgtcggttgt 
aaaaagtaaa 
ctctgggatt 
ccaagagaat 
caacgataac 
tagttctaca 
agaggggctt 
aatggaggtg 
tgagggcgat 
tcctccagct 
ttcaaggaga 
gacctgggat 
gtttagtggc 
cagcatactg 
agccttctta 
gcaaaattgt 
cacagttttt 
t taaaacaaa 
tgatctttgg 
acactgtttg 
ttgggagtca 



agacctaatt 
gacattgaac 
aatgagttct 
gtgggactgc 
aaacgtttaa 
gtcagctcta 
aaatcttcag 
tcaagaattc 
agggtaccaa 
aatcattgca 
gtactgaaag 
ccagagggac 
gaacacagac 
tacataactc 
cttgaaccaa 
ggtgaacagc 
caaaaagggg 
caaggaaaac 
cctggaaaag 
agtggcactg 
tcagttgtga 
aagaagaggc 
cccaaattga 
gatggaatga 
aatctctatg 
aaattaaaaa 
catgaaaatg 
gatgactata 
gaagaggagg 
aagcgcttga 
agagaagatc 
aactacttta 
agcacttcta 
actcaattct 
cgtgcaattc 
acagtgatag 
ttaactgtaa 
tcatagtagc 
ctttttgatt 
ctggcccagg 
gattggtttc 



gtaagtttgt 
ctggagaaga 
gcgagtgtta 
ctgcgcctgc 
ataggcttaa 
acactgatgc 
ttggcgtaaa 
cagcttcttc 
agaaactgaa 
agcggctgga 
agcctaaagt 
cagcccaagc 
agaatcctgt 
ggcggtcagt 
atacgttgaa 
tgcagccagc 
aggcaaagtg 
ttgtgaaaca 
acgacgcggt 
tgggcgtgcc 
catcagatag 
gaatcacaag 
ctcttcgtag 
actcttccaa 
tagcaaagct 
tccagctaaa 
gggtgtgctg 
gtcagtatga 
atgactatga 
ggttaatagt 
agtctttaag 
aagaaataaa 
ttgtcccttc 
taagtctgat 
tgagttagag 
caactttcca 
gagcatcagc 
agcatattaa 
tatttgctag 
tgtactgttt 
ttaaaaaaaa 



gtcaactggt 
aatttcttgt 
cacttgcgaa 
tcctgttatc 
aaagt taggt 
agataccact 
aaagaat age 
caactctacc 
gaagectgea 
gcaaaagaat 
agttctgtat 
cgcagttgcc 
gagaggtget 
gaggacaaga 
tggctataaa 
tcctgtgctg 
tcataagagt 
gtttgcaaaa 
accagatttg 
tgtgagc tac 
cttcaaaaca 
gtatgatgea 
gegtcatgat 
aataagcatc 
taataatgga 
acgagatgag 
cagtgatcct 
ggaagaaagt 
tgatgacttt 
tggaaaagac 
gettaatgee 
aaattccagt 
ac ttatcagc 
ttgtgcaaat 
gtaaagcect 
cacaggacgt 
tggctcttta 
gggtttctag 
atgtttcccc 
gtggcctttg 
aaaaaaa 



120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2517 



<210> 40 
<211> 1698 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc.feature 

<223> Incyte ID No: 1503131CB1 



<400> 40 

gaetacgett 

gggaagggtg 

gccccaccgc 

ggcgctggag 

cctgtggctc 

getgeggaaa 

acagaacagc 

ggcttttaag 

gttatctatt 

agctctgtat 

aaatgaaaac 

tagagcatta 

agcttcttta 

atctggagaa 

tggcagattg 



gctggtttgc 
gggtgagggg 
gtgggacggc 
aagaagctgt 
attcaccacc 
gccaaaccaa 
aagaggaagg 
catgtttcaa 
tgggaagaaa 
ggtgataaga 
tgttcctctc 
caagatctgg 
cctgttgaag 
aggctttcca 
gcggcagaaa 



ggceggtett 
gcgaggccgc 
gtgaacgtgg 
eggagttgag 
gtaaacactc 
acaggaagct 
ggccagagtt 
gtgaaactga 
ggtctgttta 
agectaggaa 
tgggatctcc 
aaaatgeage 
tccaagaagt 
aaatggtaga 
tagatgatag 



ggatgaagcg 
agetagggeg 
tgtcggaggg 
caactcgcag 
gcggcccatc 
tacttttctc 
tacaaaagat 
tgaaagttgt 
tgaaaatgat 
gcgaacttat 
aagtgaacca 
ctcaggtgat 
atctctatta 
ggatgcgtgt 
aaagcaactc 



gcggccgtgg 
gegaaactet 
atgtcagcct 
cagagcgtgc 
gtcaccgtgt 
tacctagcca 
tt tgcaccag 
aagaagcacc 
gtattagaac 
gaacagataa 
ccacagactc 
gcagcagttc 
gataaaataa 
atgttgctgg 
actcgaatgt 



tgagagcgtg 
cctcccc teg 
tctctgaggc 
agaccttgtc 
gggagcggga 
atgatgtcat 
ttatagtgga 
ttggaagagt 
aacttaaaca 
aggtgga tga 
tagatctcgt 
atcagaggat 
cagataaaga 
cagattacaa 
tagcagattt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



35/50 



WO 00/78954 



PCT/US00/16766 



tcttcgttgt caaaaggaag cccttgcaga gaaagagcat aaattggaag agtacaagcg 960 

caagctagcc agagtttccc tggtgcgcaa agaactcagg tcccggatcc agagcctgcc 1020 

agacttatct cgattgccca atgtcactgg cagccacatg cacctgccct ttgcgggaga 1080 

catctacagt gaagattgat ggaccagcct ctttccaggt cccaggactt tgcaagagat 1140 

ggagacaggt taggtggata gtcctgtagt gttatttttg tatattgttg agaaagaaac 1200 

actaacaaaa ggaacaacga gtaatttata aaattgttta aaatgttggt ttgtttacta 1260 

ttttattggt aaattaacat gacttaattt gaacaaagtg atgatttgtg tgtattaata 1320 

agctcacaaa tagtgattat tttcaaaaga tctttttgta aatttttttt tttgatccaa 1380 

ggccttgtga gaaatctctt gtttgtattt cttcaggtat tttcaattgt ttttctttat 1440 

tttcttcagc atttaatcac tgtatactat gtaattcctg aaggggaaga actaattagg 1500 

agttgattga gactttatta tggtatagtt aggacttgat tgtagaaggg actaaatgtt 1560 

ccaacataat ctttaccctt ctccccaaac aaaatatcaa ttctgtgtct ctacatgcct 1620 

ttgattcctt agaggtcggt aatttaacat aaaacaggga tgtcactttg aggggcagat 1680 

gaaagtctag agacctgt 1698 



<210> 41 

<211> 2250 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 1594803CB1 

<400> 41 

gctcgcagcg catccccgcg cacaggttcg 
catgttcaac gggggaatgg ccacaacatc 
cgctgccttc ctcgcactgc tcaagtttct 
gacggtgatg accacgctat acaccgccaa 
ttgcgtggag ttcctgaaga agaacctgcg 
ggcgcgactc ttcgatgaac cgcagctggc 
cactgcagac gccatcaccg cggagggctt 
tgtcctggag cgcgacacac tgggcatccg 
ctggtccgag gccgagtgtc agcggcagca 
, ggttctgggc aaggccctgg gcctcattcg 
tgcaggtccc gcacagtcgg gcatcctggt 
cttcaccgtc aaccccaagc cacgagtgga 
tgggaaggag tgcagcatca accgcttcca 
gaccagtgac cgcatcaggt tctcagtcaa 
gtatggatcc atccacgggc ccaccgacta 
tagcaacacc gtcttgggcc agaacgacac 
cttccgcgtc atgttcaagg agccggtgga 
tgccacgctc aagggcccag actcccacta 
cgagtcgccc accacgggcg ccaagacctg 
caatggcaca tccgtggagg acggccagat 
ccgacaccga caccgccctc cctccgtggg 
gctggggccc ccccaccacg cggtgccagg 
catgccacct ttctcagcat caggacgggg 
tggatcaggg cagccgggga ggtggccagg 
tcaggactag ggacagggct gtgccggcct 
gcgcctgccc acgtcgtctg ccggcggtgc 
cacattgcaa tgcatttgcg attcccattt 
ctcccagagc cgtgggtccc agaccttgcg 
acgggcccca cctgtcacgt gcccgaggcc 
tgcctggatg ccggcttgag ttctgcgcac 
ggataggcct agccctggcc caggtggtga 
gatgggccca gataaaggcc cccgctgtcc 
cagcccaaga tgggactcag accctgtgcc 
cagccggccc cgaccgggtc caggagcact 
acctggtgcc gtgggagcca cccccaggtg 
acgcccccac ccaccccgcg tgtttctgcc 
aagccatggg aggggtgtcc tcctcagacc 
aacaaatgca cctgcaaaaa aaaaaaaaaa 



tgctggccgt gggcagcgcg tctttgatgc 60 

cacggagatt gagctgcccg acgtggaacc 120 

ctactcggac gaggtgcaga ttggcccgga 180 

gaagtacgcg gtgccagcgc tcgaggccca 240 

agccgacaac gccttcatgc tgctcacgca 300 

cagcctgtgc ctggagaaca tcgacaaaaa 3 60 

caccgacatt gacctggaca cgctggtggc 420 

tgaggtgcgg ctgttcaatg ccgttgtccg 480 

gctgcaggtg acgccagaga acaggcggaa 540 

cttcccgctc atgaccatcg aggagttcgc 600 

ggaccgcgag gtggtcagcc tcttcctgca 660 

gttcattgac cggccccgct gctgcctgcg 720 

gcaggtggag agtcgctggg gctacagcgg 780 

caagcgcatc ttcgtggtgg gatttgggct 840 

ccaagtgaac atccagatta ttcacaccga 900 

gggcttcagc tgcgacggct cagccagcac 960 

ggtgctgccc aacgtcaact acacggcctg 1020 

cggcaccaaa ggcctgcgca aggtgacaca 1080 

cttcaccttt tgctacgcgg ccgggaacaa 1140 

ccccgaggtc atcttctaca cctaggctgc 1200 

gatagccgca gccccaggcc atcatctgct 1260 

cccagtgtcc cccaggccgt ctgtccactc 1320 

ttgccctgtg ttcaccacga gtgtggctgc 1380 

ccagtggcca ggccctgtgg agacaatccc 1440 

gggccagggc ccacggaccc gcagctcagg 1500 

gccgcgggcg tccctcgcgt ctcttcactg 1560 

ctctgctagg agccagcctg ggtggcgctg 1620 

ttccttttgt tcctgtccgt ttatcaggac 1680 

acccaagccc agcctgcggg gcgttcccac 1740 

gcaggattca gtgtggggac ggcccctgcc 1800 

gcggtttgca gtgtccgttc tcatccacct 1860 

agcctccctg gacggccctc gcggtccctg 1920 

ccagagctcc cctgccgcag aatggggccc 1980 

gctcgcctgt acatactgtt gccctagccc 2040 

ctgggggcac agcccctccc cactccggcc 2100 

ctgtgactcc tggaacctgc gtcctcccca 2160 

atgcccccag atgatttttt taaataaaga 2220 

2250 



<210> 42 
<211> 1082 
<212> DNA 
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<213> Homo sapiens 



<220> 

<221> misc_f eature " 

<223> Incyte ID No: 1736129CB1 



<400> 42 

ggcaaatccg 

gggccctgag 

gc tgacaggc 

caggaacctg 

gcgggggctc 

tgggataaag 

agcaggattc 

gggccccgaa 

acatgaagcg 

tctgcttcac 

gccacggtcc 

tcccttgtcc 

agatgcatga 

ttgctcagga 

tgcagcttaa 

tatggggtgc 

agggctctta 

acagggcctt 

aa 



ggatctcggc 
actgagtggt 
gcacgaatag 
taggcagcgc 
tgggcggaac 
gaagggtccc 
acccaaggag 
gtgcacccgc 
ggagcaccca 
ctgcgcccgc 
agccgccaag 
tgactgtggc 
ggtccgtgcc 
agcagggctg 
gcctctccac 
aggaactctg 
ataaagagga 
cgtgagacag 



tccgagaggc 
ccctatccca 
gcagccctct 
ttgagggtag 
aaaaatcaca 
agcaccatgg 
agaagtcccc 
tgcctcatca 
gcggacttcg 
tec ttcccct 
cccaccctgc 
aagacctttg 
cctcctggca 
catcaacact 
ggtgacgggt 
ggggccctga 
cccagaagat 
tgaaatcaga 



tcatctggca 
cttctctctg 
gctgtaagga 
egggatagea 
ggatgtcaga 
aggacccgaa 
agageccagg 
ccttcgcaga 
tggcccagaa 
cctccaaagc 
eggttgeaac 
ggcaggctgt 
ccttcgcctg 
acatteggea 
ggctctgtgg 
aggatttget 
tcttatttag 
taataatgag 



aaagggcgcg 
gggeggeget 
ggaaaactga 
getgeaageg 
ggatgtttcc 
ccctgaagag 
aggcaacatc 
ttccaagttc 
getgeagggg 
cctaatcacc 
cactactgcc 
ttctctgagg 
cacagagtgc 
tgcccggggg 
ctggtaggac 
tccctcccct 
agcttcagtc 
atcttttgtt 



gaaaccaegg 


60 


crtacrccaaccr 


120 


era c c t ga era q 


180 


cgcgtgc/gag 


240 


y y y a q v—^j a g w 


3 00 


o C4, v^- w y a y w 


3 60 


tgccacc tgg 


420 


caggagegtc 


480 


gtcctct tea 


540 


caccagcgca 


600 


cagcccacct 


660 


cggcaccgcc 


720 


ggtcaggact 


780 


gagctctgag 


840 


tcacccatga 


900 


gggaaggcag 


960 


tttggagcac 


1020 


aaaaaaaaaa 


1080 




1082 



<210> 43 
<211> 570 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1874312CB1 



<400> 43 

gttccgctgt 

ggtggcaagg 

accggcggcg 

cgccgttatc 

cgcgagattg 

ctgeaggagg 

atccaacgcc 

geggaagaga 

ttcagggeca 

agggattcct 



gctgtttttc 
cgccacgcaa 
tgaaaaagee 
agaagtccac 
egcaggaett 
cgtgcgaggc 
aagegegtea 
gggcggtgat 
ccaccttttc 
ttggggggca 



cgtcatggct 
acagttggcc 
ccaccgctac 
tgaactgett 
taaaacagac 
ctacttggta 
ctatcatgcc 
tactgtgggt 
caaataaagt 
tttttctttt 



cgcactaagc 
actaaggcag 
cggccgggca 
attegtaaac 
ctgcgtttcc 
gggctatttg 
caaggacatc 
ctctcttgcc 
tgctgtaaga 



aaactgeteg 
cccgcaaaag 
ccgtggctct 
tacctttcca 
agagctccgc 
aggacactaa 
ccactcaccc 
ggtccaagca 
aaacccattt 



gaagtctact 
cgctccggcc 
gegegagate 
gegee tggtg 
tgtgatggc t 
cctgtgcggc 
gccagcatcc 
aagggtc ttt 
atggcccaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
570 



<210> 44 
<211> 1007 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1969301CB1 



<400> 44 

cccacgcgtc cgcccacgcg tccggtttgg 

etctteggga aagegaaace caaggctccg 

gtggacagta gagcagaatc cattgacaag 

aagtataagg atcagatcaa gaagatgaga 

aaagccttgc gagttttaaa gcaaaagagg 

caacagtcat tcaacatgga acaagecaat 

accacggttg atgctatgaa actgggagta 

aagatcgacc agattgagga tttacaagac 

gaaatccaag aagcactgag tegcagttat 



gtttcttege ggctgctcaa gatgaaccga, 60 
ccgcccagcc tgactgactg cattggcacg 120 
aagatttctc gattggatgc tgagctagtg 180 
gagggtcctg caaagaatat ggtcaagcag 240 
atgtatgagc ageageggga caatcttgcc 300 
tataccatcc agtctttgaa ggacaccaag 360 
aaggaaatga agaaggcata caagcaagtg 420 
cagctagagg atatgatgga agatgeaaat 480 
ggcaccccag aactggatga agatgattta 540 
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gaagcagagt 
gatgaggcag 
gatggagttc 
attcaagcat 
aaatttgcca 
gctcttttat 
atttgattcc 
tgacttgttt 



tggatgcact 
catctgcacc 
tggtggatga 
atcttgtaaa 
taacagattt 
tttttccatt 
tttttttctt 
tcattcatta 



aggtgatgag 
tgcaattcca 
atttggattg 
acaaacacat' 
aggtttcttt 
aagagactca 
atgaaaaacg 
ataataattt 



cttctggctg 
gaaggtgttc 
ccacagatcc 
attatgggac 
cctttctttg 
ttgcttggga 
aactcagttc 
gagataacca 



atgaagacag 
ccactgatac 
ctgcttcata 
taggaaatat 
aaggaaagtt 
aatgctttct 
aaaagtattt 
agggaat 



ttcttatttg 600 
aaaaaacaag 660 
gatttgcatc 720 
ttatctttcc 780 
taattacatt 840 
tcgtactaaa 900 
ttagctcgta 960 
1007 



<210> 45 
<211> 1622 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1986873CB1 



<400> 45 

ttaaccaaat gatgcaaaac agcacctttt 

taattactgt ccttgtccac gcttcatgta 

ttctttttct tcagtaaaat ggacatgact 

tttactgaag aaaaagaagc ttttatctct 

ggaaaaggaa- agtattattg caaaatttgt 

ttgcatcatt tggttaataa gcataatgtt 

aaggcttttc ttttggaatc tctccttaaa 

cttaaatgtc cacgttgtaa ttttgaatca 

actcattgtc aaagccggca taatgaagag 

ccgccactgg aggagcagca aatttgataa 

tttgttggaa ccattctttg taagtatagc 

gacatgtatg gtgtaccgtg tttcactgtc 

tcattttttft. ,c„tggtct.ctg tctatgtgac 

tccagatgga "ttaaacttct catttctttt 

aggcattcca tttaataatc aagagcaagt 

tgtgtgtgga actaatttca gacaatagaa 

atgaaaaata aatttgagaa attttgtttc 

ttttctatct gatgtaaaat tagtttataa 

ttttatgtca tgaaatgttg gagtctcagg 

gactaagtct ttaaaatagt ttgcagcctt 

tgattttaga tctttcttgt ccatagttgt 

ccctaagatc tggttcttcc ctccccatcc 

actaatagat gattcttaat tgccaaatgt 

ttacatgtcg cctgttgggg cttaagacca 

gattcacagt agaaaatttt atagacattt 

ttcaattatg ttactgtaaa taccttgtac 

tttgtcaaat gtgtatcaac caaattaaaa 
aa 



ttcatagcac gacagcaaca aattttgcaa 60 
ttttgcaatc tcctcttcgg agataaaagc 120 
agtccagagc agtctagaaa tgtgctacag 180 
gaagaggaga ttgcaaaata catgaagcgt 240 
tgctgtcgtg ctatgaaaaa aggtgctgtt 300 
catagccctt acaaatgcac aatctgtgga 360 
aatcatgtag cagcccatgg gcaaagttta 420 
aatttcccaa gaggttttaa gaaacattta 480 
gcaaataaaa agctaatgga agctcttgaa 540 
cacagtgtga atatttgttc tacaaaggtg 600 
ttatcagata gcatagttgg atcagtagat 660 
tcagttgtgt tactaagaat gagcatttga 720 
tatcttgtaa gtcaataaat ttctgtatag 780 
aaatatgtat gaataataat acaaggaagt 840 
tgtactcaaa gcattcagtt aaagtgtatc 900 
aatattagtt gaaatgttta agaattaggc 960 
cttacatgta tttttaaatc ataagagtta 1020 
atcttaatca gcttctagat gtttattagc 1080 
gttgctgatt ttctgctaat gggaaaaatt 1140- 
ctcccacagg agacaagtga aagataagtg 1200' 
tttcagtgga gtcttccatt ctgtatctta 1260 
ccacccccca cccaccgcct gccagctcac 1320 
gttagagttt gtatatccta ctcctgggcc 1380- 
ggttgataag taggaactga aagtcttcca 1440 
ctgttaaaga aatatatcga ttttatgttt 1500 
ctgttcatgg attattttat tctaaaatat 1560 
agaaaggttt tcatgtcaaa aaaaaaaaaa 1620 

1622 



<210> 46 

<211> 2047 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2010820CB1 



<400> 46 

tgttgaaaga ctactccaca tcattataga tgataataag agaacacaaa ttgttacaga 60 

aattatctca gagattcggg cgcccattgt tactgttggt gttaataacg atccagctga 120 

tgtaagatag aaagaactca agatggctga aataaaagtt aagcttatcg aagccaaaga 180 

agctttggaa aattgcatta ccttacagga ttttaatcgg gcatcagaat taaaagaaga 240 

aataaaagca ttagaagatg ccagaataaa ccttttgaaa gagacagagc aacttgaaat 300 

taaagaagtc cacatagaga agaatgatgc tgaaacattg cagaaatgtc ttattttatg 360 

ctatgaactg ttgaagcaga tgtccatttc aacaggctta agtgcaacca tgaatggaat 420 

catcgaatct ttgattcttc ctggaataat aagtattcat cctgttgtaa gaaacctggc 480 

tgttttatgc ttgggatgct gtggactaca gaatcaggat tttgcaagga aacacttcgt 540 
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attactattg caggttttgc aaattgatga 

aatctttgac caactgatga cgttcgggat 

acttcattgt gaaggtacag aaataaacag 

agagactgct acagctaaga atgttctgaa 

atctgaactt aggactggag ctgcagaagg 

ggtcagcagc aggattcttt ctcgtcttat 

ggatgttcaa cttcgacatt gcctaggcgt 

gactaatcag gaatgctttg aagaagcttt 

ccctgcatct tctcctttag ctgaaattga 

tttgacaaga ccaagtggat taaatcctca 

aacagtacat gacaatttgg ctatgaaaat 

gccagaaatt cgagtctata caaaagcctt 

aaaagatctt ctggttctat tgaatgagat 

gagagctttg gagaaaatca agattcagtt 

agctgaagca gcacaggatg ccaccttgac 

gaataaagaa gtatatatga ctccactcag 

tactcagcta aagactaaca gaggacagag 

gaggtgtcag actgctgaag ccgactctga 

agaaatgaag atgagactac caagacgagc 

taaccttgcc caatttctca atgaagatct 

tttaagatta tgtccagtta tttgctttaa 

aacaaacctg atgtctttct gaagattttc 

tattaaagca gtagagcagc atcagttatt 

cacacagatt tatcccaatc tgaaggtggg 

aagtgattcc tatggactct ttaagtttaa 
actagct 

<210> 47 
<211> 1817 
<212> DNA 

<213> Homo sapiens 



tgtcacaata aaaataagtg ctttaaaggc 600 

tgaaccattt aaaactaaaa aaatcaaaac 660 

tgatgatgag caagaatcaa aagaagttga 720 

actcctttct gatttcttag atagtgaggt 780 

actagccaag ctgatgttct ctgggctttt 840 

tttgttatgg tacaatcctg tgactgaaga 900 

gttcttcccc gtgtttgctt atgcaagcag 960 

tcttccaacc ctgcaaacac tggccaatgc 1020 

tatcacaaat gttgctgagt tacttgtaga 1080 

ggccaagact tcccaagatt atcaggcctt 1140 

ttgcaatgag atcttaacaa gtccgtgctc 1200 

gagttcttta gaactcagta gccatcttgc 1260 

tctggagcaa gtaaaagata ggacatgtct 1320 

agaaaaagga aataaagaat ttggtgacca 1380 

tacaactact ttccaaaatg aagatgaaaa 1440 

gggtgtaaaa gcaacccaag catcaaagtc 1500 

aaaagtgaca gtttcagcta ggacgaacag 1560 

aagtgatcat gaagttccag aaccagaatc 1620 

caaaaccgca gcactagaaa aaagtaaact 1680 

aagttaggaa' agacgatgga ggtggaatcc 1740 

taaagaagaa gttacccttg tcaaaatcag 1800 

tgctgtgcgc ttccacgtta ctttggcctg 1860 

atagtccaga aaaagtgtgc atcagtcagt 1920 

ctaggaatct catttttaaa tagtctctcc 1980 

atcatgtcct tatggaaaac ttacagtgtt 2040 

2047 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2013818CB1 



<400> 47 

caacaggaag 

aaaactagga 

ggactatctc 

aagaaagaag 

atcactacag 

gaagaaacta 

attaaaagat 

acggtaacaa 

gaaaatacag 

gttagaaggg 

gagtcatctc 

ggtagaccaa 

agaagcccca 

gctaaaagtg 

aacgaggaac 

caaagccaaa 

aaagaaacaa 

cgcagcagca 

aatgagcctt 

gaggttcctg 

cctcctgctt 

ctgaaagatg 

acgtcaaaaa 

cccatagaaa 

ccaagaaaaa 

tcctatccaa 

tacacctatt 

aataatttat 

taactccata 

catacattta 



catcagcaga 
aggcaaaaga 
agaaccagca 
ttaatcagga 
gtagggaatc 
ctaaaaaaga 
ctgtagaaaa 
gtcctagcag 
gaaataagca 
tcagagggag 
agttgactgt 
gaaaaatcaa 
agaagaaaga 
aaaatgttga 
tttcgatggt 
aacgttcatt 
atzgagcagga 
ggactcggtc 
tattttctcc 
cacagctgaa 
taaggagcag 
atgcacaatc 
caaaacaagc 
ttcggctgat 
ctacagaagt 
agcaaatttt 
tgtaaagtca 
ataaattatt 
tggagtgatt 
tatatgtaaa 



tgt tgctact 
aatttctgaa 
aatacctcaa 
catactagaa 
aaaaagatta 
agttaaggtt 
tcaggaaagt 
aatgatcaga 
agatgataaa 
agaagttagt 
tcaagcagaa 
tccatctgaa 
agctcccagc 
tgttggaaaa 
gatgagctct 
gcactcagta 
agaaagattg 
tagcaaggcc 
agcgtcagaa 
agaat tagtt 
acaaaaaaac 
agtagaaact 
aagcaaaaac 
ttcccccttg 
gacaggaaca 
acgcagaaaa 
tcagaatagt 
gtaaattttt 
gtttcagtcc 
tatggcttat 



cctaagatgc 

gcttctgaaa 

aattctgtta 

aacaccagtt 

aaatcatctc 

tcatctgtta 

gttgaaatta 

aaat tgagaa 

tccagtgaca 

ccatcagatg 

tttgatatgt 

gatgtaggat 

attagaagga 

ccagctttag 

aagaaaaaac 

tcagaagaac 

ctcgccacag 

atcttgttgc 

gttccaagga 

tcggatttat 

acatccaata 

ctgggaaagc 

acagaaaaag 

gctagcccag 

ggtcttggaa 

atgctgtaat 

gtggattatt 

atgtaaacag 

aggcaatttt 

taatggaatg 



cagggcagtc 

acatctattc 

cgcctaggag 

ctgtggaaca 

agctgttgga 

caaaaaggac 

taaatgatct 

gtactaattt 

agcagctgcg 

tgagagaaga 

c tgccatacc 

c taaggctgt 

gatctacaag 

gaaaatccat 

ttacaaaaaa 

gcacagatga 

cttccttcac 

cggacctttc 

aagcaaaagc 

cttctcagtt 

agaacaagct 

caaaagcgaa 

aaagtgcttg 

ctgacggagt 

ggaacagaaa 

ttcttgggaa 

aaatatctag 

aaggtc ttca 

tctattttat 

ttaaataaaa 



agtcaggaag 

tgatgtcaga 

aggaaggaga 

agaattacag 

accagcagtt 

tcctagaaga 

aaaagttagt 

agatgct tct 

tattaaacat 

ctccaacctt 

tagaaaacgt 

taaggaagag 

aaatacccca 

tttagtgcca 

gactgaaagt 

aatgacacat 

taaatca tec 

tgaaccaaac 

taaaaaaata 

tgtcatc tea 

tgaagatgaa 

acgaatcagg 

gtcacctcct 

caagagcaaa 

gaaactgtct 

gat tt taatg 

tt tggaagaa 

ataagtaaag 

attaagactt 

tgtatac ttc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
15 00 
1560 
1620 
1680 
1740 
1800 
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acagtcaaaa aaaaaaa 

<210> 48 
<211> 700 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2302032CB1 



<400> 48 

cccgcctcgc 

ccttgcacca 

agtctgcggc 

caggtggcac 

agcagcagtt 

caggcacatc 

ccacccagac 

cagctggagc 

gaacttgtta 

ttaacaagct 

cattaactgt 

tttctgaaac 



agagttggga 
tgtagatcag 
catgaacgcg 
aacggtgctg 
taacaacctg 
cgcagcagcc 
tcagaccacc 
atgggggagt 
aaaatataga 
ccaccaggaa 
tagcagttat 
aaaaatctat 



gaaggcaggg 
cgtcccccac 
agcagcgagg 
gtggagctga 
gatgcctttg 
cccagcacgg 
accagaacca 
cggccagaat 
ttcccagagc 
gccagctgcc 
aaatgttgtc 
atactatagg 



tggggggtgt 
tttggcatcc 
gcgagagctt 
ctcccgacat 
tagctcacaa 
tccagtttgt 
tcacctcgga 
tggcctggct 
ccacccccta 
cagccaggtt 
acgttacaaa 
aaaaaaaaaa 



ggaaaaataa 
cggccggccg 
cgcgggctcg 
ccatatctgc 
gcaaagtggc 
atcggaggaa 
gacccagaca 
gtgtctcaaa 
gggttgctga 
tgggagctgc 
aaaaacagac 



aaggaaaagt 
gggacc t ccc 
gtgcaaa ttc 
ggcatctgca 
tgccagc tga 
acagtgcctg 
atcacaggta 
cacgtccatg 
ttcattattt 
taggttaaaa 
aaaaaagaga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
700 



<210> 49 

<211> 2704, 

<212> DNA " 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2326109CB1 



<400> 49 

cggggcggag gcggaggcag aggtggaggc 
acggggaatc aagagacccg gtttctaatc 
ttcggaaaaa agtaatcctc ttggaaagct 
gatacatgtt agaagacatg aaacaaggag 
atctcgagtt gattggaggc gaactaaaag 
tgaagagctt gatagtgaag aatttgatag 
tgaaaatgat- gaagagcttg atagtaacaa 
tgaaagagag ctcaacttaa gtaaaattca 
taactctggc aacggttcaa catatgaaga 
tgacttacaa gatcaggaaa aacatttaag 
tggacaaata atagaggatg atcaggaaaa 
caaacaaact ggacaaataa tagaggatga 
aagaaaaagg ctatcctctg tgatgtgtga 
agttagaaaa gtaggtgtta aacgtccccg 
ggaaatggag caaaagactc ctgaaaaaac 
gaagctcaaa gaactctcaa aacaaagatc 
tgaggactct gaaaaggaat cttgcccaag 
ggataattat gaatctgatg aagatggaga 
agatgaggag ggtgatgaag agaataaaaa 
actgaaatta gtaaaacgga attctcttta 
aagagttgtg aaggctcttc tgatcaacgc 
tgatggcaca aggcaaaaat catatgcaaa 
taaccgcttt gttcagcctc gtctagagag 
atataaggag cgagtagaaa attattctaa 
ctgttcctgc caggcttgtg gactgcatcg 
agagttgtat aacaccagga ccatgcaaat 
gttcactgtt ggcagaattt gtgccagccg 
taaattcaaa ctataccagg aatgttgcac 
acaagttaaa gaaacagtgg aaagaatttt 
ggagaaatat ggtcaacttg aagaatatct 
gtttgagttg taacacattt ccatcagaga 
tcatgattct tgtttttctg tatcatgtga 



gctttgaaag ccgtcattgt gtggcactgg 60 
cctgctctgt cactaacaag ctgcgtgatc 120 
ggaaacatag tccgaagctt aagtgaaaat 180 
aaattctaag agtcacgtgc ctgagcagaa 2 40 
aagtagtatc tcacaattac ttgatacjtga 300 
tgatgaagag cttgatagtg atgaaacjttt 360 
gggacctgat tgtaataaaa caccagcjaag 420 
aagtgaagga aatgacagta agtgtctcat 480 
agaaacgaac aaaatcaaac ataggaatat 540 
tcaagaggat.r-aatgatctca acaaacaaac 600 
acatttaagt caagaggata atgatctcaa 660 
tttagaagaa gaagacatca agcgaggaaa 720 
cagtgatgag agtgatgaca gcgatatcct 780 
tagagtggtt gaagatgaag gttcttcagt 840 
attagctgca caaaagcgag aaaaacttca 900 
tcgtcagaga cgcagtagtg gtagagattt 960 
cagtgatgaa gttgatgagg aggaagaaga 1020 
tgattatatt atcgatgact ttgtagtgca 1080 
ccaacaagga gaaaaattga ctacatcaca 1140 
ttcttttagt gaccactata ctcattttga 1200 
tttagatgaa tcttttctgg gaacattata 1260 
agatatgcta acatctcttc attatttgga 1320 
cttggtatct agaagtcgtt ggaaagagca 1380 
tgtaagtatt catttgaaga atcctgaaaa 1440 
ctactgtaaa tattcagtgc atttatcagg 1500 
agataatttc atgtcacatg ataaacaggt 1560 
taccagaatt tatcataaac tgaaacattt 1620 
cattgcaatg acagaagaag ttgaagatga 1680 
caggcggtca aaagaaaatg gctggattaa 1740 
caattttgcg gattactttc aagaagagaa 1800 
agatttttta aattcctgta aatgtgaaga 1860 
catgtttgta cattttatct atatcttcat 1920 
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ggcaaaatat 
ctaattaaac 
atattttaaa 
tgatatcctg 
ctacatgtat 
attttcacta 
catatatcat 
gtagaggaat 
tagagtcttt 
gttcttactt 
atttttatcc 
gacaaactca 
ctaaattttc 
aagg 



tttgtttaaa 
ttttatctga 
gttatataat 
tcagtttagt 
ttttacggaa 
atatattttt 
tcagttcaat 
gcttttttcc 
caaccctcac 
ttttctattc 
atagcttttt 
gccaaaaaca 
ccataaagta 



acatgattat 
tgtacataaa 
aatggcttta 
caaaatatat 
ttcctgtcca 
tactgctatc 
cccattgctt 
ttttcttgaa 
attcaggaat 
aattttgtct 
ccatataagt 
cttggaaagc 
ataaagtaaa 



cttaattcct 
agcattatct 
taacagatga 
tgtatcttaa 
gatctgttta 
atctaaatca 
cagaatcaga 
agtttctctt 
aattttatgt 
atatattgat 
atgtatctta 
atattttggt 
atggttgcag 



gtgaagtggc 
taatttttaa 
ctgtcaagtg 
aaatgtattt 
ttcttttaca 
gtgggccatg 
tcatagcata 
tcaaatagat 
aaattttcat 
gattaagatg 
gggtcagaac 
atctgcat tg 
agtgaacaaa 



ac tctacact 
gtctgtaaat 
aa tgagc tgt 
agactaacgt 
gtattaaatg 
gaagaaacct 
tctattc taa 
ttcaggcctc 
gctttataat 
tattttattt 
tgctgaaaga 
ctttgcatat 
aaaaaaaaaa 



<210> 50 
<211> 3086 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2354751CB1 



<400> 50 

cgccggcgtc 

gccgcgggac 

gtggggccac 

accagatgat 

ctttgggccc 

acccgggagg 

ccgccggaac 

aacctttgcc 

gctacttcgc 

cccactggcc 

ggtgctgtat 

c tggtgt tat 

gtcatccctg 

cctgaaggag 

tggcttagag 

ggaggaaccg 

tcaggccagc 

cgtgggccgc 

ccagctccgc 

cacaccagct 

ccagatagcc 

cttcctgcat 

gcgcctcaac 

ggcctatgca 

ctacttctct 

gtttcagctg 

ggagccaggg 

tgccctctac 

ggctgagctc 

gctgctgcct 

gtttgggcgc 

gtttcgagag 

gggcgtggag 

cttccccatg 

tggctgcccc 

gggcaagctg 

ctatgagttc 

cctggcaggt 

cggaaggcat 

gcgcacactc 

gggccctgag 

cacactgcgg 



cgcccagccg 
agaagtcggt 
catttgccca 
cgtatcccct 
cctagggcct 
catggacggc 
ctggctgagt 
ctagacacgg 
ccacccgcgg 
ttgtctcagc 
gggacagtgg 
gggcggctgc 
ggccctgccc 
gttctgcccc 
catctcaacc 
caggaaccag 
attctggtga 
tatggtgaga 
ctcaaccagc 
cagcgtgacc 
gctgcctgct 
gcccccacgc 
ttcagcgggg 
gcccgaacca 
gaagaggatg 
ctgcacatc t 
cgtgcaggca 
attgtgctgg 
gtgggccgtg 
ctgcgggctc 
a tggtgggta 
gaggactact 
ggcccccggc 
ttcatggggg 
atcaagaacc 
ggccggcagg 
cagaaccagc 
gtgcgcatga 
gccctggatg 
ctgcctgtct 
gcctgcaagg 
ctgtccaaca 



acccctctgg 
cctaggcccc 
gggcctcttg 
tcctgatcca 
ttatacgcca 
tgt tccccag 
ggttcagccg 
tccacgttga 
agctggccct 
tctttaaccc 
gcacaggcaa 
cggccttcga 
cagcctccct 
tgatggctgc 
tcgacttccg 
ctgctatcat 
ccactcggcc 
tctgcggttt 
cgtactgcgg 
acctggtgca 
tcctgccgtc 
ctgctgggca 
aaaccctgga 
tgggcaagtt 
tctgtggctg 
tccgtcggga 
ccttcgtgtt 
gtttgcgcaa 
ttggggagga 
tgcctctgct 
aaagccggga 
acaacgatga 
gccacccaga 
ggcttctc-tc 
tggatgccct 
tgctgccccc 
gcttctccgc 
caccagtcaa 
aggtgaactt 
tcctgcgtgc 
acctccgaga 
acccgctgac 



gccgcggccg 
ccaggctctg 
gggctctggt 
ctggagttgg 
ccacggaagc 
cgcctctgca 
gctgcccagg 
ccctgtgatc 
ggagcatcag 
ggatgcctgt 
gagcacgctg 
gctgctcatc 
gtgccaactt 
tgctgggtcc 
gctggcaggc 
cgtcaacctg 
ctctgccatt 
ctctgatacc 
gtatgccgtt 
gatgctctcc 
ctattgctgg 
gacccttaca 
cagcactgac 
ggcctatgag 
cctggaggct 
tgccctgagg 
caccgtgccc 
gacgaccctg 
cgtcagcctg 
cttcaacctg 
ggcggtggc t 
tgttctggac 
tgagccccct 
tgcccacaac 
ggagaatgcc 
atcagagctc 
tgaggtgctc 
gtgcacagtg 
ggcctcctgc 
ccggaagc tg 
cctgttgctg 
ggaggcaggt 



acggctcccg 
accttctttc 
tttagaagag 
ccccttcaag 
tcggtagata 
actgaagc ta 
gaggagcgcc 
cgcgagagta 
ccaccccagg 

gggcgccggg 

gtgcgcaaga 
cccttctcct 
gtggcccagc 
cacctcctct 
acgggacttt 
ctgcgcaaat 
ggccgtatcc 
aacctgcaga 
ggcggttcag 
cggaacctgg 
ctcgtttgtg 
agcatctata 
ccctccaatt 
ggggtgtcct 
ggcatcagga 
tttttcctgg 
gccatgcagg 
caaaaggtgg 
gtac tgggca 
atcaaggtgg 
caggccatgg 
cagatgggcg 
gaggatgaag 
cgagctgtgc 
caggccatca 
c ttgaccacc 
age tccctgc 
gtggcagctg 
cagctagatc 
ggcttgeaac 
catgaccagt 
gttgccgtgc 



gaactgggca 

ccaggatgag 

cactccagcg 

gggagegtec 

gcgctccccc 

tacageggea 

agtttggccc 

cccctga tga 

ccgggctccc 

tgcagacagt 

tggttctgga 

gtgaggacct 

gctacacgcc 

ttgtgctcca 

gtagtgaccc 

acatgctgcc 

ccagcaagta 

agctctactt 

gtgtctc tgc 

aggggcacca 

ccaccttgca 

ccagcttcct 

tgtccctgat 

cccgcaagac 

eggaggagga 

ccccatgtgt 

aatacct ggc 

gcaaggaagt 

tcatggccaa 

ttccacgagt 

tgctggagat 

ccagtatcct 

tcttcgagct 

tagctcagct 

agaagaagct 

tcttcttcca 

gtcagctcaa 

tgctgggcag 

ctgctgggct 

tcaacagcct 

gecaaattae 

taatggaggg 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2704 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020. 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
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gctggcagga 
aggcctggag 
ggcgtacaac 
cccttccctg 
gactacttcc 
ctggtgcctg 
gcccaccccc 
aagccctact 
gtctgtaatc 
aagaccagtc 



aacacctcag 
ctgc tggctg 
ggtgctggtg 
gaactgctac 
ctctctcaac 
tcctttattc 
tctgtgactg 
gtctctggtc 
ccagcacttc 
tggccaacat 



tgacgcacct 
cccagctgga 
acacagcggc 
agtgagtcct 
tgtgctcctc 
ctggtttctg 
gcgcttaaac 
ccactgaatt 
ggggggccaa 
ggtgaa 



gtccctgctg 
ccgcaaccgg 
cctggccctg 
gtccctggtc 
caatcctagg 
actgagcctt 
tgcaccaatt 
ctagtatcag 
ggcgggcaca 



cacacgggcc 
cagctgcagg 
gccagagctg 
ccattgcccc 
gagtgcttct 
gtcaacctag 
ccc taccacc 
ggtcggctgt 
tcacctgagg 



ttggggacga 
agctgaacgt 
cccgggagca 
cagccct tea 
gggcctgggc 
ctctgagcca 
atttctccta 
ggtggctcac 
tcaggagttc 



2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3086 



<210> 51 

<211> 676 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc^feature 

<223> Incyte ID No: 2378058CB1 



<400> 51 

ttcgtttttg 

tgcttgctga 

cccagaaagg 

ggaatcagtg 

ggtgcctgcc 

ctttgtagtg 

gtcctgtttc 

aggatacttg 

atcaaaaatt 

atttaaatac 

atacacattt 

tcattttgat 



tegegegagg 
aaagctgcag 
aataaaagag 
gcttttgagg 
gagagggect 
ccacccactt 
acccagccac 
gaac taagaa 
tttcgaaatc 
cagt tttcca 
tctttttctg 
gtacaa 



ttttggtttg 
aggccgccag 
gtcagatgat 
acgtggctgt 
tgtacaggga 
ccaaaccagc 
agggagtcct 
gatatactta 
gtcaatgctg 
agtaaggagt 
tagactggta 



tgaggategg 
gagcccacgc 
gacagctgtg 
gtacttcact 
tgtgatgctg 
tttggtctct 
aagcaggaat 
cttagctaaa 
ggaagacagg 
tgatgtaaga 
gatttaatgt 



cgagtggcgc 
acctgagact 
tccttaacaa 
acgaaggaat 
gagaactatg 
catctggagc 
gac tggagag 
gcagtgttac 
agaaaagc tt 
gccaccttaa 
ttttcattca 



ccaccatctc 
tttaccttta 
ccaggcccca 
gggecatcat 
aggctgtggc 
aagggaaaga 
caggctggat 
gtagaatagt 
aattcttgac 
acgatgtcaa 
ttaaaataac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
676 



<210> 52 

<211> 2135 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> unsure 

<222> 1913 

<223> a or g or c or t, unknown, or other 



<220> 

<221> misc__f eature 
<223> Incyte ID No: 



2595747CB1 



<400> 52 

agtcccagcc 

tgcctttatg 

tctagataca 

tgatatagat 

actttttgga 

gacttttggg 

gatgacaatt 

agtggtaaac 

ccgaccaaga 

aagccgctgg 

ctctatagtt 

tatacagatc 

ttgtgatggg 

caatgtgtgc 

aggattgaag 

tagttcccct 

attttataca 

gtcaaaaaca 



ttggaagcac 
ttacttactc 
a tagacaaaa 
acactctgtg 
gctgt tgtac 
aataaacaaa 
gaggaatttg 
c tctttcttc 
tgctgtctca 
ggttacagtg 
ggatttggct 
attgaatatg 
acagctaaca 
tacacagcat 
aaagtagtgc 
ggcaataata 
taatttagca 
taaatggcca 



actgtgtaga 
aggctcgatt 
gcacaatgga 
cagttt taga 
gctgggcaga 
aagttctagg 
cagcaggtcc 
attttactgt 
ggggaaagga 
ggacgagtga 
tgtatggatc 
agaaaaagca 
cattcagggt 
gtgeaacact 
atgagacacc 
atggcacttc 
ttataataca 
caaaaaagta 



atttctcacc 

atttgatgaa 

tgcaataagt 

gagagacaca 

agcagaatgt 

aaaagcactt 

tgetcaatet 

caaccctaaa 

atgc tgeate 

tcgaatcaga v 

tattcatggc 

aaccctggga 

catgttcaag 

caaaggtcca 

tgetgeaage 

aatagaagat 

tcttggctaa 

gtttgagtgt 



aaacatctta 
cctcagcttg 
gcagaagggt 
ctcagtattc 
cagagacaac 
tccttaatcc 
ggaattttgt 
ccccgagttg 
aatagattcc 
ttcacagtta 
cctacagatt 
cagaatgata 
gaacccatag 
gattcccact 
aagactgttt 
ggacaaattc 
ataataccat 
tatgaatatt 



gggcagataa 
ctagtctttg 
ttactgatat 
gagaaagtcg 
aattacctgt 
ggttcccact 
cagatcgtga 
aatacattga 
agcaagtaga 
atagaaggat 
atcaagtgaa 
ceggett tag 
agatcctgcc 
atggcacaaa 
ttttcttttt 
cagaaatcat 
acaatctagt 
taaaattgta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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agataagaaa 
taaaaacaga 
aaaaaaaagc 
ggtctt tcaa 
ttttgtttta 
ggttggatga 
aatttcagaa 
catttgtaat 
atagagaagg 
taaatggaat 
ccctgtcact 
tcccaagatt 
acattgggga 
aatttcttaa 
tagttaaatt 
gggggggaac 
atggccggtg 
tttgaggaaa 



cagtttctta 
ttttccattt 
acagcttttt 
tagtttggga 
actattttga 
aaagagatgg 
aattaagaag 
aatgctaaaa 
atattttgta 
tttgtgtaaa 
aaactgcaca 
taggatgatg 
ttaatccttt 
acaaacattg 
ccgggc ttgg 
acggt taaca 
ggtttaaaac 
tccccttttt 



gagcagatag 
tcttacaact 
caccttcatc 
attgaaagat 
aggttaggtg 
gtcagtattc 
ctgactttat 
aaggcctaat 
gtagtatagt 
ttcacaaaaa 
actatacctg 
aacggtgatg 
caacaggcaa 
cttcatgacc 
aaaagataaa 
caatttcgac 
gcggtgactg 
gcaagggggg 



aaaaatgctt 
ttaagagaaa 
ttgtataatt 
tcttgttata 
agatgggcaa 
ctacagaatt 
atttggtggt 
aaaatgccca 
aatgtgttat 
tgtgatataa 
tc tctctgtg 
aacgcttggg 
ggagaacaag 
aaatt tgata 
cgagttttca 
cattttggag 
gggaaccatg 
tgatt 



atttaaatct 
agaactgggt 
tcatagattg 
tatagctagt 
ataggcttaa 
cttattaact 
ttgaagtatc 
agaaaatatt 
gtagtacagt 
acaggatcta 

tgggggacac 

tgaacagcca 
gatt taaaat 
agggaggatt 
tgaccaacca 
ggggccggtt 
ggggtaccaa 



ttgcatgatt 
ttaatggttt 
gctgacttag 
ttgggtttgt 
ctattttgaa 
caaataacta 
ttgttgttag 
cagtgcattt 
tttaaagcta 
agactggatt 
tgctgatgat 
tcaactttaa 
tgactaacac 
canaacaggg 
ttttctgagg 
tacaatttaa 
cattggattt 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
16 80 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2135 



<210> 53 
<211> 1385 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2634391CB1 



<400> 53 

gccaaccatt 

attctccaca 

aatacagtcc 

gtccagaatc 

aatctgagaa 

ctcacttgca 

agatggcctc 

acaccacaca 

tggcaaatat 

atattgctgt 

cacttcacat 

ctgccaatca 

cagactgctg 

ttcaggcgat 

actatgatgg 

aactccagag 

agagtctggt 

cttacaatgg 

tagatgc tgt 

acgaacagcc 

tgaagggaaa 

ggctagcaac 

cttatat tgg 

acagt 



ccaagtcagg 
gaaccagcat 
ttttcccata 
acagttttgc 
tattgctaat 
cagcttcagc 
tgactcttca 
gttagggaaa 
ttcccaagac 
tgcccaaggg 
gctggatatt 
gcatctcatt 
gggaagaaca 
tcagaaggga 
cctgactccc 
aaatcaacag 
tgataccatt 
caacac tgcc 
ccgcctgttg 
agtgcatttg 
gtccattcag 
actcactgtc 
caaatctaag 



ggctcccaac 
gtagagcagc 
cctccccagt 
ccaaaccaaa 
cccatgcaga 
atgatgccca 
aacacttcac 
tcactttttc 
cagtttcttt 
agaagggcac 
aaagagcaca 
gtgcaggatc 
cctctgcatg 
gcagtgggaa 
cttcactgtg 
cctcattcac 
aagtgcctaa 
ctccatgttg 
atgaggaagg 
gttcccgatg 
cagagagctc 
agttaggcag 
ttgtttctat 



aaatgataga 
agccacacta 
cccccgctta 
gcttagtttc 
cttcctccag 
gcagcgcctg 
tgccattctc 
agtggcaggt 
caaaggatgc 
tttcctatgt 
atggacagag 
tcjgtgaacat 
tgtgtgctga 
gtaatcagtt 
cagtcatagc 
ctgaagttca 
ttcaaatggg 
ctgccagctt 
gagcagaccc 
gccctgtggg 
caccgtatta 
tcctgatgta 
gacacaaaca 



ccaggcttcc 
cacccacaaa 
tgaaccaaac 
ccttcttggt 
tgttcagcag 
tgaggccatg 
aaacatggga 
ggagcaggaa 
agatggtgac 
tcttgcaaga 
tgcctttcag 
cggggcacag 
gaagggccac 
tgtggatctt 
ccacaatgct 
ggagctttta 
agcagcggtg 
gcagtatcgg 
aagtactcgg 
agaacagatc 
gctccattag 
tctgtacata 
tatttagttc 



ctgtaccagt 
ccaactc tgg 
ctctttgatg 
gatcaaaggg 
caaaatgatg 
gtggggcacg 
aatccaatga 
gaaagcaaat 
acgttccttc 
aagatgaatg 
gtggcagtgg 
gtgaacacca 
tcccaggtgc 
gaggcaacta 
gtggtccatg 
ctgaagaata 
gaagcgaagg 
ttgacacaat 
aacttggaga 
cgacgtatcc 
cttggagcct 
gaccatttgc 
actattatat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1385 



<210> 54 

<211> 1500 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2637522CB1 



<400> 54 

cacagagtga acaagagaga gtcatttggg 
ggatagctaa aactacgtga gcctggcgag 
agactgctat ctgggacgag acaagttgtt 



aaacaaaagg agaattttac agagagagag 60 
ggtgcagagc agaaagtaga gactgtccga 120 
aaagggacag gagagaaagc agagctattt 180 
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caagagtgag 
gtgagtgaag 
•cagtgggttt 
tcatctgccc 
acaacctgtg 
cccggggctc 
aggttgtcct 
ttatcgacat 
tcatgtgcga 
ccacctgctc 
ccaccattta 
gcaatgaccg 
acaatagccg 
tggaggagcg 
caacgcgatc 
tccctgccta 
atataccaga 
gccaagggtg 
tcccccctga 
tttgtgtcca 
ttaattttgt 
acccgggggt. 



ccacagaagg 
gccaggagca 
caagccgctg 
catctgcctg 
ccgcaaatgt 
caccactgtg 
ggacagacac 
ttacaagcag 
ggagcatgaa 
tctctgcaag 
caaacgccag 
cgtgcaagca 
gaggcagaag 
caacggtgag 
cgacgggact 
aaagcctggt 
gcccaacaga 
cccaccaatg 
ggggccccac 
cagagggggc 
aatagaaggg 
tataataccg 



gaatccagag 
gggcccaggc 
ctaggggatg 
gagatgttct 
gccaacgacg 
tcttcaggag 
ggtgtctacg 
gagtcatcca 
gaagagaaga 
gtc ttcggtg 
aagagtgagc 
gtgatcacac 
cagttgttaa 
ctgctgcagg 
cattccggtc 
ttggagggtc 
aaattgtgtg 
ggggagaacc 
tatttttttc 
cctcgggacc 
gtccaaat tt 
aggggctttc 



gccatctaag . 

caggcacgac 

cacacagcat 

ccaaaccagt 

tcttccaggc 

gccgtttccg 

gcctgcagcg 

ggccgctgca 

tcaatattta 

cccacaagga 

tcagcgatgg 

agatggagga 

accagaggtt 

cgctggcccg 

agtaatgggc 

actgtccaaa 

cggacactta 

tttgtacact 

ccacataagc 

cccctgggga 

gtgcgaaaag 

gggggaaaag 



cgaggaaggg 
caccgagggg 
ggacaacctg 
ggtgatcctg 
ctcgaatcct 
ctgcccatcg 
aaacctgcta 
ctccaaggct 
c tgcctgagc 
ctgtgaggtg 
catcgcgatg 
ggtgtgccag 
tgagagcctg 
ggaacaagca 
gaagcaactt 
tcccaagttt 
gtaaaatgcc 
ccacaggttt 
gagttgtggg 
aaagttccac 
ctccaaaaag 
gggccacccc 



tctacaggca 
atgaacttca 
gagaagcagc 
ccctgccaac 
ctatggcagt 
tgcaggcatg 
gtggagaaca 
gagcagcacc 
tgtgaggtgc 
gccccactgc 
ctggtggcag 
actatcgagg 
tgcgcagtgc 
ggacaagctt 
gggaaggccc 
ccacatttgg 
cgtccaagag 
aaaaaagtgt 
aaaaaaactt 
agaggcgctc 
tttcccaaac 
cccttttttg 



240 
300 
360 
420 
480 
540 
.600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
12 60 
1320 
1380 
1440 
1500 



<210> 55 
<211> 2051 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2650980CB1 



<400> 55 

gcggacgccg aggctcgcgt. tccgccgctg 
ccctgccgag cgtgaccccg aactgtcacc 
ccgcggttga ttcttcacca cactgaaacc 
aggcttcaga gtgtaacctg tactcgggcc 
tctcaaggtg aactcgtcca tcctaaggca 
cacgcggaca agttaggtga gaaggtagaa 
aattctaccc gggaaactcc tcccaaaagc 
gaaccagaca taagttcaga ggaatctgtc 
ccacctgcta cttcgagtga ggcagagcag 
gaagaaaata agtcttctga ggaaaccaaa 
gagaagaaag tgaaaaaaac aattccttcc 
agggcccaga aacaaacaec gatggcttct 
actgaggcca ttaaggcatg cttccagaag 
tacatcatcc ' ataagtatcc ttctctggag 
gcactgaaaa gagaattaaa tagaggagtc 
ggaagttttg ttgtggttca gaaatcaaga 
aataggagct ctgcagtgga tccagaacca 
gcctttactc gcctttgtga acctaaagaa 
tctcagtatt atcctaagct ' tagagtggac 
cagagagcag tagagagggg ccagttagaa 
ttccagctga agaaatcagg ggagaaaccc 
atcttgtctg ccattgctgc catgaatgag 
aagtatgtcc tagagaatca cccaggaacc 
aaaaccctgc agaaatgcga aaagaatggg 
agtggcacct tccagctctg ttttccctat 
aaagagccag atgattctag agatgaggat 
tctgaggatg aagagccgcc acctaagaga 
ccagggaagg ccgcatctgt gaagcagaga 
gcccagcggg ggaaagctag gcccttgcct 
gccaagaaga ccagaccctc atccacagtc 
aagcctgcaa ccagtgcaag aaaggaagta 
aagaagtctt tcagagtgaa aaagtaaatt 
tcaaaatctt attttctaag gtcagtgtgc 
cattattttc ctcccctatg aacattgtgg 



cttccgtcgc cggggcgggc gccgcggtgt 60 
gcgccgagcc ccagcctcat ccgtcggtgt 120 
attaggaaaa atccttgtgg ttaacagcag 180 
tagaaattat ttaaaatggc gactgatacg 240 
ctcccactta tagtaggagc tcagctgatc 300 
gatagcacca tgccgattcg tcgaactgtg 360 
aagcttgctg aaggggagga agaaaagcca 420 
tccactgtag aagaacaaga gaatgaaact 4 80 
ccaaaggggg aacctgagaa tgaagagaag 540 
aaggatgaga aagatcagtc taaagaaaag 600 
tgggctaccc tttctgccag ccagctagcc 660 
tccccacgtc ccaagatgga tgcaatctta 720 
agtggtgcat cagtggttgc tattcgaaaa 780 
ctggagagaa ggggttatct ccttaaacaa 840 
atcaaacagg ttaaaggaaa aggtgcttct 900 
aaaacacctc agaaatccag aaacagaaag 960 
caagtaaaat tggaggatgt cctcccactg 1020 
gcttcctaca gtctcatcag gaaatatgtg 1080 
atcaggcctc agctgttgaa gaacgctctg 1140 
cagataactg gcaaaggtgc ttcggggaca 1200 
ctgcttggtg gaagcctgat ggaatatgca 1260 
ccgaagacct gctctaccac tgctctgaag 1320 
aattctaact atcaaatgca tttgctgaaa 1380 
tggatggaac agatctctgg gaaagggttc 1440 
tatcccagcc caggagttct gtttccgaag 1500 
gaagatgaag atgagtcatc agaagaagac 1560 
aggttgcaga agaaaacccc agccaagtcc 1620 
gggtccaaac ctgcacctaa agtctcagct 1680 
aagaaagcac ctcctaaggc caaaacgcct 1740 
atcaagaaac ctagtggtgg ctcctcaaag 1800 
aaattgccgg gcaagggcaa atccaccatg 1860 
ttataggaaa aaagggtatc atgatgaaat 1920 
atttgtttag ttttgatgct tttcaaatta 1980 
ggagggactc taaataaacc agtttaggca 2040 
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aaaaaaaaaa a 2051 

<210> 56 

<211> 1694 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 
<222> 239 

<22 3> a or g or c or t, unknown, or. other 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2939607CB1 



<400> 56 

gaattttgtg tttgcatcgt catcatgcgc gtgttgagcc ctttagtaat gtttttgtgt 60 

attagcttct tatttctagg tagacggtaa tttcgtcaat gaaaaaattt aacggtgacc 120 

gtcacccctt gcctcagtca ctttctgtcc ctcgcgcggc cccggggttc accgcggacg 180 

ttcctgttct gtggagaata acggctgcac ctgtatattg acgcatattt ggcggcggng 240 

gtgctttgtc tcctcagcac tctgctgtca ctcaaggaag tatcatcaag aacaaggagg 300 

gcatggatgc taagtcacta actgcctggt cccggacact ggtgaccttc aaggatgtat 3 60 

ttgtggactt caccagggag gagtggaagc tgctggacac tgctcagcag atcgtgtaca 420 

gaaatgtgat gctggagaac tataagaacc tggtttcctt gggttatcag cttactaagc 480 

cagatgtgat cctccggttg gagaagggag aagagccctg gctggtggag agagaaattc 540 

accaagagac ccatcctgat tcagagactg catttgaaat caaatcatca gtttccagca 600 

ggagcatttt taaagataag caatcctgtg acattaaaat ggaaggaatg gcaaggaatg 660 

atctctggta tttgtcatta gaagaagtct ggaaatgtag agaccagtta gacaagtatc 720 

aggaaaaccc. agagagacat ttgaggcaag tggcattcac ccaaaagaaa gtacttactc 780 

aggagagagt ctctgaaagt ggtaaatatg ggggaaactg tcttcttcct gctcagctag 840 

tactgagaga gtatttccat aaacgtgact cacatactaa aagtttaaaa catgatttag 900 

ttcttaatgg tcatcaggac agttgtgcaa gtaacagtaa tgaatgtggt caaactttct 960 

gtcaaaacat tcaccttatt cagtttgcaa gaactcacac aggtgataaa tcctacaaat 1020 

gccctgataa tgacaactct cttactcatg gttcatctct tggtatatca aagggcatac 1080 

atagagagaa accctatgaa tgtaaggaat gtggaaaatt cttcagctgg cgctctaatc 1140 

ttactaggca tcagcttatt catactggag aaaaacccta tgagtgtaaa gaatgtggaa 1200 

agtcttacag ccagagatct caccttgttg tgcatcatag aattcacact ggactaaaac 1260 

cttttgagtg taaggattgt ggaaaatgtt ttagtcgaag ctctcacctt tattcacatc 1320 

aaagaaccca cactggagag aaaccatatg agtgtcatga ttgtggaaaa tctttcagcc 1380 

agagttctgc ccttattgtg catcagagga tacacactgg agagaaacca tatgaatgct 1440 

gtcagtgtgg gaaagccttc atccggaaga atgacctcat taagcaccag agaattcatg 1500 

ttggagaaga gacctataaa tgtaatcaat gtggcattat cttcagccag aactctccat 1560 

ttatagttca tcaaatagct cacactggag agcagttctt aacatgcaat caatgtggga 1620 

cagcgcttgt taatacctct aaccttattg gataccagac aaatcatatt agagaaaaag 1680 
cttactaata aata 1694 

<210> 57 

<211> 1177 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_ feature 

<223> Incyte ID No: 3098421CB1 



<400> 57 

gacctggccc ccgaatcagg aaaaggcagg 

agccgcttag tattctgact ctattctgcc 

aagaagaaaa ttcccaacat ggacaaacca 

gcgcccaaga ccgatgagga gaggcctccg 

gaggagcagt cttcggagga gcagtcctcg 

cctgagctcc tgcctgagat gctcctctcg 

aggaaggacc tgtttgaggg gcgccctccc 

cataagcttg aagaaggaag ctttaaagaa 

ggggacatac atggcagaaa tttaagcaat 

gaagagatga aaagagtaag aaacaaactg 



tattggaacc cacggcctgg gtgttagtcc 60 

tttcttgtct cctaagaata actgtgcttg 120 

cgcaaagaaa atgaagaaga gccgcagagc 180 

gtggagcact ctcccgaaaa gcagtccccc 240 

gaggaggagt tctttcctga ggagctcttg 300 

gaggagcgcc ctccgcagga gggtctttcc 3 60 

atggagcagc ctccttgtgg agtaggaaaa 420 

aggttggctc gttctcgccc gcaatttaga 480 

gaggagatga tacaggcagc agatgagcta 540 

atgataatgc actggaaggc aaaacggagc 600 
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cgtccttatc 
ccattgccac 
ttaacttttt 
ttccccctaa 
agatgtacat 
ccatttttgt 
tagacttcaa 
tgttttatct 
ataatgaaaa 
gcacttaata 



ctatttaatg 
ctggactttc 
gtgaggcttt 
aatctacaag 
tatatattgt 
aaaaaaaatg 
aggcataaca 
gaatttctca 
aatcacacaa 
aatacttgtc 



tgttcggcct 
tgtttgcatt 
attttagatg 
tttccctctt 
gaaacgaaaa 
tatatttata 
gtggttgtgt 
ttttttcagg 
ttttgaagaa 
agaactttaa 



<210> 58 

<211> 1219 

<212> DNA 

<213> Homo sapiens 



ttaattctgt 
ttcttaatgc 
tttagcatgt 
tcagtcatga 
aagcaatttt 
tattaatatg 
ggtaagataa 
acaaacgttt 
aactgtcaat 
aaaaaaa 



tttgcctgct 
cttttcccat 
aactcgctta 
gccctacaca 
caaatggtat 
caaagaaaaa 
taggtgattt 
tacttgtgtt 
cagcttataa 



aatagta ttg 
attctgaatt 
aagttgaggt 
tttgcatgaa 
atattgtatc 
gctaaaagta 
tttaaatttt 
gcaaaaatat 
cgacaatgtg 



660 
720 
780 
840 
900 
960 
1020 
-1080 
1140 
1177 



<220> 

<221> unsure 
<222> 1139 

<223> a or g or c or t, unknown, or other 



<220> 

<221> misc_feature 

<223> Incyte ID No: 3296650CB1 

<400> 58 

gctcggaatt cggctcgagg tagccagtgc 
tctctgggat cccagccctt acacgataga 
gggtgctgca gagaagatgc aatggtagct 
ctgaaaatgg aagatgtggc cctgaccctc 
caggtgaacc tctacagaga tgaaaagcag 
ggtgaaatac agactaagag cagggacttg 
catgggaaga tatgccacct gagggaagac 
ggtgaacagg agggcaggtt acaaagaaag 
tattgccatg aatgtggaaa gagttttgct 
atccacactg gtgagaaacc ctatgaatgt 
tctgcccttg tcattcatca gagagtccac 
tgtggtaagg tcttcagtca cagctcaaac 
gagaagccct atgagtgtga tgactgtggg 
gaacatcaca aaattcatac tggggagaag 
tttaggcgga attcacatct cctgagacat 
aatcctgagc acggggagtc ctgggaaagt 
actgaggctc ccgtgtctta taaatgtaat 
agtcttattg aacatcagaa aatccacact 
ggaaaaggtt . tcactcgaac ttcatacctt 
aaaactcttt cacagtgacc catggttatc 
actgaagcaa cccttggag 

<210> 59 

<211> 1309 

<212> DNA 

<213> Homo sapiens 



cagccaatga aggctctgtt caagcatgaa 60 

gttctccagg ttcctgggct tgcccaggga 120 

tccaggctca ctccagggtc ccagggtttg 180 

actcctgggt ggacacagct ggattcatct 240 

gagaaccata gcagcctggt ctcccttggt 300 

cctccagtca agaagcttcc agaaaaggag 360 

attgcccaga ttcctacaca tgcagaagct 420 

cagaaaaatg ccatagggag taggcgacat 480 

caaagttcag gcctgactaa acacaggaga 540 

gaagactgtg gaaagacctt cattgggagc 600 

actggtgaga agccatatga gtgtgaagaa 660 

cttatcaaac accagagaac ccacactggg 720 

aagaccttca gccagagctg cagcctcctt 780 

ccataccagt gcaatatgtg tggcaaagcc 840 

cagaggattc atggtgataa aaatgttcag 900 

cagggtagga cggaaagcca gtgggaaaat 960 

gagtgtgaga gaagtttcac acggaataga 1020 

ggtgacaaac cctatcagtg tgacacatgt 1080 

gttcaacatc agagaagcca tgtagggana 1140 

atgccaacat tggtgctcat ttgtcactga 1200 

1219 



<220> 

<221> misc.feature 

<223> Incyte ID No: 3687719CB1 



<400> 59 

ctggaagctg 

agcagcctct 

acatcgccct 

gcatctaccg 

agaacagcat 

accgcaagcc 

agcacggcag 

cccggggccc 

tccccaacgc 



agcctgaccc 
gcccgggcct 
tattgctatg 
ctacatcatg 
ccgccacaat 
aggcaagggc 
c t tcctacgc 
cgccaaggca 
cacgaccggc 



•agcaggtccc 
ggcgccccca 
gccatccaga 
ggccgattcg 
ctgtcactca 
agctactgga 
cgccgccgcc 
cgccgtggac 
aggcagtgct 



agcagcc tgg 
caactgagcc 
gctcaccggg 
ccttctaccg 
acgagtgctt 
cgctggaccc 
gcttcacccg 
ccctcagggc 
cattcccacc 



cagcccggcc 
aaccaagcct 
gcagcgggcc 
ccacaaccgg 
tgtcaaggtg 
tgactgccac 
gcagacaggt 
gaccagccag 
agagctgcca 



agcatgcagc 
ccctacagct 
accctcagtg 
cccggc tggc 
ccccgcgatg 
gacatgtttg 
gctgagggca 
gacccaggag 
gatcccaagg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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gcctaagctt 
atggcaggcc 
caggggagct 
gcttctcaga 
gcatcgggag 
caggccttga 
gctcactccg 
gcttccctgt 
cgccaggaat 
ctcaccctcc 
ggtgagccaa 
ggggcctccg 
gctctcaggg 



tgggggtctg 
tcggccaccc 
ccccgtggcc 
ggctgagagt 
cagctaccag 
gcacctcttg 
ggccccactg 
ccagggaggc 
gttcttcttt 
gggctgagcc 
ggacaaccac 
gacaaacttg 
ccaataaagc 



gtgggggcca 
atggagccca 
acctcatctt 
tttaa taagg 
tgtcggctgc 
gcctcagcag 
cccctgccaa 
tccggctacc 
gagtaaaggc 
tggctctagg 
acagaaagcc 
ggggtgaggg 
catgtgatga 



tgccagccag 
aagagatttc 
cctcatgccc 
cccc tacgcc 
aggcactgaa 
ccccctcccc 
ctgaccacaa 
cattggggct 
agcctcacct 
acctgaagaa 
aggattgaag 
gaagcagggc 
tgaaaaaaaa 



tatgtgccca 
cacgcccaag 
agcgtttggc 
cgtcttgtcc 
tttttgcatg 
tgcaccaccc 
ggaaccctgg 
gaccccctgc 
cgggcagtcc 
etc tcgaagg 
cggtgctcag 
etc ttgggat 
aaaaaaagg 



gcaaccactg 
cctgcatgcc 
tttcctgccg 
ccggaatcag 
ggggctgacc 
acccctccag 
gttgcaggtg 
etatacegga 
ctgcaggtcc 
acccagccc t 
ccaggcccct 
ttactctgtg 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1309 



<210> 60 

<211> 1326 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 
<223> Incyte ID No: 



3774188CB1 



<400> 60 

gaggagctca 

agagegaacg 

cgggggtggt 

gtccggacca 

accgegggaa 

cagggagaag 

tcaaagtgaa 

gggaaccatc 

gccattccca 

gggatcgaga 

ggcgacggga 

tggttcctcg 

gctttgaact 

aatatagtga 

aaaatgatga 

gacaccgccg 

tctttcaata 

agccctacat 

agecacagag 

gagaatacgt 

aggatgagga 

gatacaeggt 

ccagca 



actgatctgt 
ggagcgaggg 
ggtgaagcag 
ctccggtggt 
ccgagcccga 
aagcaagtct 
geaggagegt 
agaacaggaa 
ccaaaggaga 
cactcagaac 
gcatcgccag 
gcctggtggc 
ttctggggca 
gcccccagaa 
ggtgcttcca 
cattgeagae 
tcggcttgtg 
cattgacctt 
atactatgaa 
cttgctccat 
ggaggaggaa 
ttccttcttg 



tttctttege 
ageeggegaa 
gagegtctea 
agcccgtc tc 
ggagttagcc 
cctcgcagta 
gaggatcatc 
cacaggagag 
aegtctaacg 
ctgcaggctc 
aggaatgacg 
aacaacaaag 
cttcttgagg 
gcacgtatcc 
gtcatgtaca 
attccaattg 
gaatataccc 
ggctcaggca 
ctaaaagaaa 
gagtegtegg 
gaagtgtctg 
gaagtctttg 



ccagccaaaa 
gaeaceggga 
geccagaagt 
cgccgaccag 
ggtccccacc 
agagaaaccg 
cceggagagg 
ctaggaacag 
agaggectgg 
aggaagaaga 
ttggtggtgg 
aaaaagaggt 
acaccaacac 
ccaaaaaacg 
tacatcgaca 
atcacccgtc 
gtgctgatgg 
atggaacctt 
aggatgtact 
acac ttctga 
acagctagca 
attgactcag 



tcacagaatg 
eggggaegtg 
cgcacctccc 
cgagccggcc 
caaaaagaaa 
aagtcctcac 
aegggaggat 
tgacegggae 
gagtgggcag 
gcgggagttt 
eggcagtgag 
gcccgctaaa 
tttccggggt 
gtggcgtctc 
gagtgegtae 
ttgttcaaag 
cacagt tggc 
cttaaacaac 
caaatttgga 
aatagacagg 
aactaagaac 
agagcactat 



aaggcggtga 
gtgetgeegg 
gcccaccgcc 
cgctcgggcc 
aacaaggect 
cactcaacag 
cggcagcaca 
agacaceggg 
ggtcagggac 
tataatgeca 
tctcaggagt 
gaaaaaccaa 
gtagtca tta 
tacccattta 
ctactgggtc 
cagcatgegg 
cgaagagtga 
aaacgtattg 
ttcagtagca 
aaagatgacg 
ccaaactatt 
ggtggtgggt 



60 
120 
180 
240 
300 
360 
420 • 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080. 
1140 
1200 
1260 
1320 
1326 



<210> 61 

<211> 1097 

<212> DNA 

< 2 1 3 > H omo s ap i ens 

<220> 

<221> misc__f eature 

<223> Incyte ID No: 4349106CB1 

<400> 61 

gcggcttccg ggatttggcg gtggcctttg 
caggggctcc cagtccttcc atctgggagg 
agaacctctg ttactctgtg accggcaggc 
catcccggaa gctgggaaat gggactgttg 
egggaggagt gggaacacct ggactcagat 
gagaactacg gaaacctggt ctctctgggt 
tttttggagc aaaggaaaga gccctggaat 
ccagatatct tttctcatga tactcaaggc 
ttccaaaaag tgatattgga tggatatggg 



ttggctgcag taagagctca gtctcttcac 60 

ccaaggcggc ttcgcgttct gagaatagac 120 

acegggagat ccgtagctca gaegecagga 180 

acattcaggg atgtggccat agaattctct 240 

cagaagcttt tatatgggga tgtgatgtta 300 

ctcgctgtct etaagcegga cctgatcacc 3 60 

gtgaagagtg cagagacagt agccatccag 420 

ctcttaagaa agaagcttat agaagcatca 480 

agctgtggcc ctcagaattt aaacttaagg 540 
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aaagagtggg 
tctgcatgag 
tcgaaaaggg 
tatgccttgc 
gggtagggaa 
ctgggggcag 
atattcgttt 
aatttatttt 
tataatccaa 
ttaaatatta 



aaagtgaggg 
aaaaaggacc 
gtgaagttta 
aatgtatgta 
agggttgaag 
gtctatatat 
acaatgttag 
ctgggtagga 
aaagctttac 
aaaaaaa 



caaaataatc 
aaagacaagt 
tatgggtcta 
catcttcaca 
tatagatgaa 
tttattttta 
ctcaggatct 
gtaaagctgg 
atgaatgttt 



ctatggtgaa 
gaattttctg 
tttatttgtc 
aaaaaaatct 
gcagaagtgg 
tgtctttaat 
tgtttacctt 
ttctattcct 
ctgtatgtaa 



aaaaaatcaa 
agggtgatga 
taacatgtac 
taaaaaaatt 
tacatgatta 
agcatttgta 
tggacaggga 
cagttatgta 
tgtggtatat 



caagataatc 
aagtgttgct 
agttaaggtt 
aaatgggtgg 
gtagttgaag 
taaatgtaca 
gggagggaga 
aattatctgt 
caataaaaaa 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1097 



<210> 62 

<211> 2404 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 4834217CB1 



<400> 62 

ctcgctggct agtaggagag actggtgctt 
ttaaataaaa agtcgaggac acggcggtcg 
ccatttgctc cctggaggcc ctcgcgtctt 
cggtgaggga gcccggaacg attccttcgc 
ctttgtggga cggaccctgg cgggccctgc 
cgatttgggg ctgggggccg ccgtccccgt 
cgtggagtac ccgggagtgg tgcgtgatgt 
ggaaggcgtc tcccggatct acgcagaccc 
caaggaccca tactgccacc cagtgtgcgc 
ccgcatcagg aagagaacga ggcggcagaa 
ggtcacattt gacatggaga tccttggcat 
gtctgacttc cagtacttgg ctgtgcatac 
tgacaaggtg ctcatgctcc ggcccgagaa 
ctacatcccc ccacccatct tctcccggct 
agagacccag caccgggaag gctacaacaa 
cctgagcaga gcccggcgcc cccacaatgc 
gcccaagcag ccactggagg ctgcagccca 
ggaccggaag gtggaggagg agctgaggaa 
aaatgctgtc aaggccaaca tcagcgtcca 
catagcctat tacatgataa caggcccctg 
cccccgaaaa aacccagatg ccaagattta 
aatgaaacac ggttacgccc ccagtgactt 
ctacagcctc cccatcaccg tcaagaagac 
gaagcagggc ctgggcccgt cggggacgag 
caagctcaag gactctgtct acatcttccg 
gttctaccag ttatgcgact tgaatgtgga 
cggggcagag aattcctgca cagaacggga 
gctcagggac accatgtccc tcatgatccg 
cttttccagc tcagccaagg ctgatggcgg 
agacgaggag gatgaggagg aggaggaaga 
cagtgaaaac gaaatggaga cagagattct 
cctccctgac ccggccagac tggtgtctgg 
cacccacagt gcccggaatg gccctaggag 
agggtgcagc tgaccctagc actggctgtg 
aggggttagg gacatcccca aagggtatac 
catccagcca gtgagtgggc acccaatgcc 
tggagctggg cagctgtgga gccccaggcc 
gtggtggtgg agatagcgtg gggagcctcg 
tagctctgga tcccaaccgt ttggcacagc 
tccttattaa accagtttcc cgagaaaaaa 
aaaa 



gccccgcccg gtggactaac tcgcttaatt 60 
ttttcccgaa gacatgggcc ctcccatggg 120 
gctgagcccg gggagttagg atgacgcgag 180 
ggaacaattg aggcgaggcc tttgggagta 2 40 
cagacgcaca gggatggcgg cggaggcggc 3 00 
ggagctgagg cgggagcgac gcatggtgtg 3 60 
ggctaagatg ctgccgactc tgggcggcga 42 0 
caccaagagg ctggagctgt acttccggcc 480 
caaccgcttc agtaccagca gcctgctgct 540 
aggggtgctg ggcactgagg cccactccga 6 00 
catctccacc atttacaaat ttcaggggat 660 
ggaagcaggc ggcaagcata cgtcaatgta 720 
ggaggccttt ttccaccagg agctgccgct 780 
ggacgccccg gtggactact tctaccgacc 840 
tccccccatc tcaggtgaga atctgattgg 900 
catctttgtc aactttgagg atgaggaggt 960 
gacgtggagg agagtctgca ctaaccccgt 1020- 
gctgtttgac atccgtccca tctggtcccg 1080> 
cccagacaag ctcaaggtct tgcttccctt 114 0 
gcgcagccta tggattcgat ttgggtatga 1200 
tcaagtcctc gatttccgaa tccgttgtgg 1260 
gccggtcaaa gcaaagcgca gcacctacaa 1320 
atccagccag cttgtcacca tgcatgacct 1380 
tggtgctcgg aaaccagctt ccagcaagta 1440 
ggaaggggcc ttgccaccct atcggcagat 1500 
agagttgcag aagatcattc accgcaatga 1560 
tgggtggtgc ctccccaaga ccagcgacga 1620 
gcagaccatc cgctccaaga ggcctgctct 1680 
aaaagagcag ctgacgtacg agtctgggga 1740 
ggaggaggag gacttcaagc catccgacgg 1800 
ggactacgtg tgacagggcc caaggctggg 1860 
cctaatgagg gagccggggc tccccattgc 1920 
gccctctgag gagagctaga gtcccagcaa 1980 
acatgctgct tggtgctgcc tctggtcctg 2040 
cctggctctg ccacccatga accagcccag 2100 
tctcaggatg agaccagtaa atgccggagg 2160 
acaggccagt ctcgcttggc tctcatgact 2220 
cccatggtct cacgtggcaa gaagtgcctt 2280 
tttggccaca gccaggcccc tctggaattg 2340 
aagaataaaa acagcagaaa aaataaaaaa 2400 

2404 



<210> 63 

<211> 1900 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> -Incyte ID No: 5156094CB1 



<400> 63 

agctggagca ccgtgaggaa 
acagaggagg aaatgatttt 
gccccaatgg ttcgggtagg 
gacattgttt actgtgagga 
aatgaggtgc tcagcacagt 
tgtgaaagag agcagaacag 
cttgctgtgg ccaggcttgt 
ccaaaacaat attccaccaa 
attgagaaga tcctcagcac 
cgcatcctgc catcgctaga 
attgctgcca tcgcagttca 
tgtgaagtca tcaaagccat 
tctcatgacc acatccaaca 
tcttccgtga tggtggcccg 
ctgcggcccc tggaggaggt 
cactacacca acaccaagta 
cagggaaggt tgctccatgc 
ggtgccttct atgaggagac 
aagacttcag agcagacagg 
gtcaagtttg accggagagc 
tggtgccgga gggagaagtt 
cgcctgttct cctctattgt 
aagtccaaga aactggcgga 
cctgagggtc ggctgggtga 
gaccaagacc ctgggggccc 
aagccctttg tggccttggg 
ttcctgcctt agtcacccct 
tgaaccagat gccgttgaac 
aggggccatc agcctagagc 
ggtggatctg agtgacaggg 
tccccaacct gacattggta 
ctgcttcagc cttgggcatc 



gaagcgaggt tctttttaag 
gaatagcctc tctc tgtgt t 
gactcttcca atgaggctgc 
gctgatcgac ctcaagatga 
ggactttgtc gcccctgatg 
ggtggtcttc cagatgggga 
agaaaatgat gtggctggta 
gggaggaatg ggagctgccc 
tcttgttaaa gggacacgca 
agataccctg agccttgtga 
tgggaggaag cgggaggagc 
tgctgatacc ctctccattc 
gtattcggac atagaggact 
agcagccatg tggaacccat 
catgcagaaa tacatcagat 
ctgcttgtgc cagatgctac 
tgcccagtct tcccgggaaa 
cacacaggag ctggatgccc 
ggagccagct gaagatacct 
atacccagcc cagatcaccc 
ggcacagcct gtgtatgaaa 
caccgttgct gaa'caaaagt 
gcaggctgca gccatcgtct 
ggagagccct tccttgcaca 
cagagctcag gagctagcac 
aagtggtgaa gaaagccccc 
ccatgggcct ggtgctaagg 
agtttgctgg tcttgcctgg 
atggaccagg ggccgcccag 
tcaagttctc tttgaaaaca 
ctgtgcaata" aagacacccc 
ttcataaaaa aaaaaaaaaa 



agttcagctg cgaggctgta 60 

accataataa gctaatcctg 120 

tggccctgga ttatggagcg 180 

ttcagtgcaa gagagttgtt 240 

atcgagttgt cttccgcacc 300 

cttcagacgc agagcgagcc 3 60 

ttgatgtcaa catgggctgt 420 

tgctgtcaga ccctgacaag 480 

gacctgtgac ctgcaagatt 540 

agcggataga gaggactggc 600 

gacctcagca tcctgtcagc 660 

ctgtcatagc caacggagga 720 

ttcgacaagc cacggcagcc 780 

ctatcttcct caaggagggt 840 

acgcggtgca gtatgacaac 900 

gagaacagct ggagtcgccc 960 

tttgtgaggc ctttggcctt 1020 

agcaggccag gctctcagcc 1080 

ctggtgtcat taagatggct 1140 

ctaagatgtg cctactagag 1200 

cggttcaacg ccctctagat 1260 

atcagtctac cttgtgggac 1320 

gtctgcggag ccagggcctc 1380 

agcgaaagag ggaggctcct 1440 

aacctgggga tctgtgcaag 1500 

tggaaggctg gtgactactc 1560 

tggctgtgga tgccacagca 1620 

cagaagttag atgtcctggc 1680 

gggtggatcc tggccccttt 1740 

ggagcttttc aggtggtaac 1800 

ctaccctcac ccacggctgg 1860 

1900 



<210> 64 
<211> 2901 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc^f eature 

<223> Incyte ID No: 5665139CB1 

<400> 64 

accacagcgc agtctcactc tgtccttcag 
actgcaacct ctgtctcctg ggttcaagtg 
agatacagga ttgacttcta aagactcttg 
aggaaagcaa aggagtcggg gatggctctt 
gccatagaat tctctcagga ggagtggaaa 
agagacgtga tgctggagaa ttataggaac 
ggatgtggcc atagaattct ctcaggagga 
tctatacaga gatgtgatgt- tggagaatta 
taaatgtgag atcaaggaat taccaccaaa 
gacagtaatg ttggaaagac atgaaagcca 
ccagaaaaat gtacatgact ctcagtgtct 
agtgcgtgtg acctataagg aaagtctcat 
tgatgcacaa aagcagcctg ttaaaaatca 
agaactgcag ctatttcaag ctgaagggaa 
tgtcaacagt agttccttag tttccccacc 
catttctcat acatatgaat gtaattttgt 
aaatattggg acagaacact acaaatgtaa 
acattttact atacatcaaa taatccatac 
tggcaagatc ttcaataaaa aatcaaacct 



gctgtgagtg cagtctcatg atcttggctc 60 
attcttgtgc ctcagcctac caagtagctg 120 
gtacctgagg aagaaaccca gaagaggaag 180 
tctcagggtc tattgacatt cagggacgtg 240 
tgcctggacc ctgctcagag gactctatac 300 
ctggagtctg tggggtctat tgacattcag 360 
gtggaaatgc ctggaccctg ctcagaggac 420 
taggaacctg gtctccctgg gtattttttc 480 
aaaggagagt aatacaggag aaatattcca 540 
cgacatacaa gatttttgct tcagagaaac 600 
gtggaaacat gattgaagac attataagcg 660 
tggtagaaga gacatgcatg gtagaaagga 720 
gcttggatta aacccgcagt cacatctacc 780 
aatatataaa tatgatcaca tggaaaaatc ' 840 
ccaacgtatt tcttctactg tcaaaaccca 900 
ggattcatta ttcacacaaa aagagaaagc 960 
tgagcgtggc aaggcctttc atcaaggctt 1020 
taaagagacg caatttaaat gtgatatatg 1080 
tgcaagtcat caaagaattc atactggaga 1140 
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gaagccatat 
gcatcgcagg 
taatcaaatt 
atgtaatgaa 
tcatactgga 
ctaccttgta 
tggaaaggtc 
gaaaccttac 
tcactggaga 
cagtcacaag 
atgtaatgaa 
tcatgccggt 
acatcttgta 
tggcaaagtc 
aaaagct tat 
tcatcggaaa 
tagtatgcgt 
atgtaatgaa 
tcatgccgga 
aagccttgta 
tagcagagcc 
gaaactttac 
tttcaaagaa 
tggcacaatt 
ctgagagtgg 
agagcattta 
atcaaaaac t 
ggcatcaaaa 
aataagcctt 
aggcaaatgt 



aaatgtaatg 
attcatactg 
tcacaccttg 
tgtggaaagg 
gaaaaacctt 
caacatctga 
ttccatcata 
aaatgtaatg 
attcatactg 
tcatccctag 
tgtggcaagg 
gagaaacctt 
caacatcaca 
ttcagtcaaa 
aaatgtaatg 
attcacactg 
tcaagcctca 
tgtggcaaac 
aagaaatcta 
agacatacaa 
tttagttttt 
taatgtactg 
ttcttgctgg 
ctcacattgt 
gaaaccat ta 
taataattat 
tttttgatga 
ccggaacatc 
tcagtgtaat 
aaaaaaaaaa 



aatgtggcaa 
gagagaaacc 
cacaacatca 
tcttccatca 
acgaatgtaa 
tcattcatac 
tttcacacct 
agtgtggcaa 
gagagaaacc 
taaatcactg 
tcttcagtcg 
ataagtgtga 
gaatccatac 
attcatacct 
aatgtgggaa 
gagagaaacc 
ctaatcatca 
tcttccgcga 
acacatgtaa 
taatttatac 
gttcaaggct 
aatgtggcaa 
tgagaaacct 
acaatattgc 
tgacttcaaa 
atgtgataga 
aaccatacaa 
acagggttta 
attcatgatt 
a 



ggtcttccat 
atataaatgt 
aagaattcat- 
aatt tcacac 
caaatgtggc 
tggagagaaa 
tgcacaacat 
ggtcttcagt 
ttacaaatgt 
gagaatccac 
caattcatac 
tgaatgtgac 
tggagagaaa 
tgcatatcat 
ggtcttcggt 
tttcaaatgt 
tgcgatccac 
caattcatat 
ttaatgtggc 
tggagaaaaa 
taataaccgt 
ggtcttaagg 
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